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0.0  Introduction  and  Overview 

This  report,  in  the  form  of  a  set  of  notes,  details  Honeywell's  research 
results  of  the  past  year  in  Robust  Multivariable  Control  Theory.  These  notes 
are  made  up  of  four  major  parts.  Part  0  gives  a  review  of  the  required  nota¬ 
tion  and  mathematical  background.  Part  1  reviews  recent  results  on  the 
problem  of  analyzing  the  performance  and  robustness  properties  of  sys¬ 
tems.  Part  2  presents  the  results  on  synthesis  which  are  the  highlight  of  this 
report,  and  Part  3  outlines  how  the  methods  of  the  previous  parts  apply  to 
control  of  large  space  structures. 

In  the  context  of  these  notes,  the  words  analysis  and  synthesis  have 
specific  meanings.  Analysis  is  used  to  describe  the  process  of  determining 
whether  a  given  system  has  the  desired  characteristics.  In  general,  this  may 
range  from  the  use  of  mathematical  tools  to  simulation  to  experimentation, 
although  analysis  is  typically  applied  primarily  to  describe  the  former.  Syn¬ 
thesis,  on  the  other  hand,  is  the  process  of  finding  a  particular  system  com¬ 
ponent  to  achieve  desired  characteristics,  which  are  typically  expressed  in 
terms  of  some  analysis  tools.  Analysis  and  synthesis  are  just  two  aspects  of 
the  more  general  problem  of  engineering  design- 

The  results  reported  in  these  notes  represent  a  significant  advancement 
in  the  state-of-the-art  of  robust  multivariable  control  design.  In  addition  to 
providing  specific  mathematical  results,  these  notes  describe  a  general 
approach  to  control  problems  which  is  intended  to  provide  the  foundations 
for  a  new  paradigm  for  control  theory  broader  in  scope  and  content  than 
what  has  been  previously  available.  This  paradigm  is  introduced  in  Part  1  on 
Analysis.  An  important  aspect  of  this  new  paradigm  Is  the  treatment  it  gives 
to  model  uncertainty. 


Modern  Control  Theory,  the  dominant  paradigm  for  the  past  20  years, 
has  its  basis  in  Stochastic  Optimal  Control  and  Estimation  Theory  [LQG],  This 
theory  essentially  restricts  model  uncertainty  to  additive  noise.  The  theory 
provides  a  methodology  for  analyzing  the  impact  of  noise  on  system  perfor¬ 
mance  and  synthesizing  to  reduce  that  impact. 

The  inadequacies  of  this  view  of  uncertainty  became  widely  accepted  in 
the  late  1970' s,  as  robustness  to  plant  uncertainty  became  a  major  theme  in 
the  Modern  Control  Theory  community  [LMC].  Ironically,  this  involved  a 
renewed  interest  in  the  Classical  Control  paradigm  ([Bod],[Hor]),  which 
Modem  Control  displaced  within  the  theoretical  community  (if  not  among 
practicing  engineers).  This  new  direction  provided  useful  design  tools,  includ¬ 
ing  Singular  Value  Analysis  and  Multivariable  Loop  Shaping 
([DStlJ.[Doyl].[DS«]). 

While  providing  an  important  perspective,  as  well  as  practical  tech¬ 
niques.  the  methods  based  on  singular  values  still  require  rather  restrictive 
assumptions  about  uncertainty.  In  particular,  plant  uncertainty  must  essen¬ 
tially  be  modelled  as  a  single  "unstructured  perturbation" 

The  Structured  Singular  Value  (SSV),  n,  was  developed  several  years  ago 
to  correct  this  deficiency  in  singular  values  ([Doy2],[DWS]).  In  the  context  of 
the  general  framework  discussed  in  this  memo,  the  SSV  provides  a  very 
powerful  mathematical  tool  for  the  analysis  of  complex  systems.  Indeed,  we 
believe  that  this  framework  together  with  the  SSV  and  the  synthesis  tech¬ 
niques  discussed  later,  has  the  potential  to  form  the  basis  for  a  new  para¬ 
digm  for  control  theory.  Part  1  of  these  notes  describes  the  general  frame¬ 
work  for  control  system  analysis  and  synthesis  which  includes  all  the 
viewpoints  discussed  as  special  cases.  In  particular,  the  assumptions  about 


3 


uncertainty  required  by  each  methodology  are  compared. 

In  Part  1  it  is  shown  that  each  analysis  methodology  boils  down  to 
evaluating 

KIL  “=2.»  or  (jl  (1) 

for  some  transfer  function  Pq.  Thus  when  the  controller  is  put  back  into  the 
problem,  it  involves  the  synthesis  problem 

min  j  jfj (P, K)  j  #  for  0=2,"®,  or  fi  (2) 

subject  to  internal  stability  of  the  nominal.  Here 

Ft(P,K)=Pn  +  PM-PafO-'P^. 

The  solution  of  this  problem  for  a-2  and  »  is  the  focus  of  Part  2  on  Syn¬ 
thesis  Theory.  The  solution  presented  there  unifies  the  two  approaches  in  a 
common  synthesis  framework.  The  0*2  case  was  already  known  and  the 
results  are  simply  a  new  interpretation.  The  a=«  case  had  been  solved  only 
for  special  cases  where  P12  and  P2\  are  square.  Also,  the  existing  solutions 
did  not  have  computational  schemes  allowing  their  use  on  even  moderately 
sized  problems.  These  two  limitations,  especially  the  former,  restricted  the 
application  of  the  pioneering  Hm  methods  to  fairly  simple  problems,  such  as 
sensitivity  minimization.  The  new  solution  presented  in  Part  2  eliminates 
these  two  limitations. 

Unfortunately,  this  new  solution  for  the  H2  and  Hm  suffers  from  the 
same  limitations  imposed  by  restrictive  assumptions  about  uncertainty  as  do 
the  underlying  analysis  method*.  While  the  SSV  is  a  great  improvement  for 
analysis,  synthesis  for  the  case  is  not  yet  fully  solved.  It  is  shown  in  Part 
1  that  n  may  be  obtained  by  scaling  and  applying  jj*jj—  so  a  reasonable 
approach  is  to  "solve” 


(3) 


mri!l|PFJ(P.-/0^'l|l|, 

by  iteratively  solving  for  K  and  D.  With  either  K  or  D  fixed,  the  global 
optimum  in  the  other  variable  may  be  found  using  the  n  and  Hm  solutions 
described  in  Parts  1  and  2,  respectively.  Example  designs  have  been  done 
and  this  scheme  seems  to  work  well,  but  global  convergence  is  not 
guaranteed.  In  fact,  a  counterexample  has  been  constructed  where  (3) 
reaches  a  local  minimum  which  is  not  global.  11113  is  the  subject  of  ongoing 
research. 

As  noted  above,  the  main  results  of  these  notes  are  the  synthesis  solu* 
tions  of  Part  2,  particularly  for  the  Hm  case.  This  part  focuses  on  the 
mathematical  problem  of  finding  K  e  ^*xm*  such  that 

Ti(P,K)  e  (4) 

and 

U*y  (s) 

where 

€  .  pv  c  (P) 

and  7  are  given  and 

F,(P.A 0  =  Pu  *  PitK(I -PafO-'Pzy  (7) 

The  important  mathematical  questions  include  finding  for  which  7  solu¬ 

tions  exist  and  then  parametrizing  ail  solutions  to  (4)  and  (5).  The  notation 
used  here  is  that  Rp  denotes  proper,  real-rational  and  RHm  denotes  the  sub¬ 
set  of  Rp  analytic  in  the  right-half  of  the  complex  plane.  The  above  problem 
is  a  natural  generalization  of  the  standard  (rational)  matrix  "best  approxima- 


P  - 


Pu  Pit 
[P  21  Pa 


\Ti(P.K) 


tion"  or  "generalized  interpolation"  problem.  The  relationship  between  (4)- 
(5)  and  these  familiar  problems  will  be  discussed  in  Part  2  of  these  notes 
which  presents  an  approach  to  solving  (4)-(5). 

A  Important  issue  in  applying  mathematical  results  of  the  type 
developed  in  these  notes  to  engineering  problems  Is  the  ability  to  generate 
algorithms  that  compute  the  solutions  In  a  reliable  and  efficient  manner. 
These  notes  pay  close  attention  to  this  issue  by  developing  the  mathematics 
in  a  way  that  makes  algorithm  development  reasonably  straightforward.  This 
is  often  done  at  the  expense  of  elegance.  It's  worth  noting  that  virtually  all 
the  results  in  these  notes  have  been  implemented  in  computer  software  and 
that  this  software  is  currently  being  used  to  do  experimental  designs  for  non¬ 
trivial  engineering  problems.  From  a  practical  engineering  point  of  view,  the 
results  have  been  most  encouraging. 

The  remainder  of  this  part  of  the  notes  simply  defines  notation  and 
reviews  some  of  the  specialized  methods  which  are  standard  within  the  con¬ 
trol  theory  community  but  are  not  typically  well-known  to  mathematicians. 


0. 1  Definitions  and  Notation 


0.1.0  Notation 


STMBOL 

: - 1 

USAGE 

. 

:  : 

=*!«<'> 

: 

1.  (z*y)(t)  :=  f  x(t  -r)y(r)dr 

* 

. 

2.  A*  =  complex-conjugate  transpose  of  complex  matrix  A 

i 

3.  7*  =  adjoint  of  operator  7 

U  ,  6 

1 

l+_(f )  =  unit  step  function; 

5(f)  =  unit  impulse 

s 

C(s)  =  two-sided  Laplace  transform  of  g{t) 

1 

orthogonal  complement 

B  i 

?(A)  =  largest  singular  value  of  matrix  A 

i 

p 

p(A)  -  spectral  radius  of  matrix  A 

2 


YX  =  TAr-1  .  A  =  diag(i,....ln) 

Then  columns  of  T  are  (possibly  nonunique)  eigenvectors  of  YX  correspond¬ 
ing  to  the  eigenvalues  jA*j.  It  is  shown  in  Lemma  1  at  the  end  of  this  section 
that  YX  has  real  diagonal  Jordan  form  and  that  A^O.  This  is  a  consequence  of 
Tk 0  and  A'aO. 

Although  the  eigenvectors  are  not  unique,  in  the  case  of  a  minimal  reali¬ 
zation  they  can  always  be  chosen  such  that 

?  =  TYT’  =  E. 

£  =  =  E, 

where  E  =  diag(ffj,ff2 . ffn)  and  E2  =  A.  This  new  realization  will  be 

referred  to  as  a  balanced  realization  (also  called  internally  balanced)  [Moo], 


Suppose  G  =  ^cJ3  *s  a  balance^  realization  for  G  and  can  be  parti¬ 
tioned  as 


An  A 12 

Bi 

Aai  Agz 

Bz 

Ci  c2 

D 

with  corresponding  partitioning  of  the  balanced  gramian  E  =  |q  .  Sup¬ 
pose  Et=diag(ffi,a2,...ffr).  Ej=diag(<7r+1,ffr+2 . cn)  and 

•  ■  ■  fcer><7r+1St(jr+2S:  ■  Then  it  is  immediate  that  the  truncated 

system 

An 

=  C,  D 


h 


is  balanced  since 


0.3  Gramians  and  Inner  Transfer  Functions 


0.3. 1  Gramians  and  Balanced  Realizations 


Suppose  G= 


C\D 


where 


is  stable. 


Define 


the 


controllability  gramian  Y  as 


F  k  f  eM BB'iAt dt 
0 

and  the  observability  grojriixm  as 


Xk  fe^C-Ce^dt. 

0 

By  considering  the  corresponding  matrix  differential  equations  it  is 
easily  shown  that  Y  and  X  satisfy  the  Lyapunov  equations 

AY  -  YA’  -  BB'  =  0 
A'X  +  XA  +  CC  =  0 

Note  that  F>:0  and  AiO.  Furthermore,  the  pair  (A,B)  is  controllable  i ff  Y> 0 
and  ( C,A)  is  observable  iff  A>0. 

Suppose  the  state  is  transformed  by  nonsingular  T  to  £  =  Ts  to  yield  the 
realization 


\a  \§ 

A  S 

TAT1  TB 

d  j  D 

with 

d  D 

cr1  D 

Then  the  gramians  transform  as  ?  =  TYT  and  X  =  (7‘“,)’AT“1.  Note  that 
YX  =  TYXT~l  so  the  eigenvalues  of  the  product  of  the  gramians  are  invariant 
under  state  transformation. 

Consider  the  similarity  transformation  T  which  gives  the  eigenvector 


decomposition 


0.2.4  linear  Matrix  Equations  : 


Property  1 :  (Solution  of  Sylvester  Equations) 

Consider  the  Sylvester  equation 

AX  ~  XE  =  C  (1) 

where  /  G  KnKn,  B  C  IR'1*’",  C  G  Kn*m  are  given  matrices. 

Then,  there  exists  a  unique  solution  X  G  IRn*m  if  and  only  if 
#0.  Vis  l . ,«  and  j  =  1 . ,m. 

Remark : 

In  particular,  if  i?  =  .  (1)  is  called  the  "Lyapunov  Equation”  and  the 

necessary  and  sufficient  condition  for  the  existence  of  unique  solution  will  be 
that  Re[A*(.4)  +  +  0.  V  i,j  s  i . n. 


Property  2 ;  (Solution  of  linear  Equations) 
Consider  the  linear  equation 

A  X  =  B 

where  A  G  lRnxn ,  B  G  IR"xm  are  given  matrices. 
The  following  statements  are  equivalent : 

(i)  there  exists  a  solution  X  G  Hnxm . 

(ii)  the  columns  of  B  G  Range  ( A ) . 


(iii)  ran* 


A  B  |  *  rank  A 


(Iv)  Ker  (At)  C  Ker  (BT) 


The  following  lemma  characterizes  the  relationship  between  zeros  of  a 


transfer  function  and  poles  of  its  inverse. 


B.  Lemma  Suppose  G  - 
(s0 ,  x0 )  such  that 


U  If 


with  D  nonsingular.  Then  there  exists 


( A  -  BD^C)!,  -  s„zt  , 
iff  there  exists  u,  #0  such  that 

G<s0K  =  0 


Proof 

(if) 

C(s0)u,  =  0  implies  that  CT^s)  has  a  pole  at  s„ .  Thus  =  (s0.  x,)  such 
that  Gc0  *0  and 

{A-BD~lC)z,  =  s0x. 


(only  if) 


Proof:  The  right  inverse  case  will  be  proven  and  the  left  inverse  case 


follows  by  duality.  Suppose  DD'  =  /.  Then 


=  / 

Corollary  7  Suppose  is  a  right  inverse  for  D  and  let 

G  = 


A-BD'C 

-BZ 

D'C 

Z 

Then 

CS  =  DZ. 

Corollary  T  Suppose  D'  is  a  left  inverse  for  D  and  let 

s  = 


A-BD'C 

- BD' 

ZC 

Z 

Then 


» 

A  BD'C 

BD' 

• 

G(?  = 

0  A-BD'C 

-BD' 

C  DD'C 

DD' 

A  BD'C 

BD' 

1 

0  A-BD'C 

-BD' 

[c  c 

r 

r 

* 

. 

/  j 

1  I7  71"1  (7  ~J 

Conjugating  the  state  by 

0  1 

on  the  left  and  U  A  -  U  j 

on  the  right 

» 

yields 

A  0 

0 

■  .. 

G&  = 

0  A-BD'C 

-BD' 

[c  0 

/ 

A\B 

C\D 


4.  Output  Injection 

x  =  Ax  -  Bu  -*  £  =  AS  -r  Bu  -  Hy 


I  H 

A  \B 

A+HC 

B+HD 

0  I 

C  D 

C 

D 

5.  Transpose  (Dual) 


G  -»  Gr 


a\b 

\at 

CT 

C\D 

*  m 

~D7, 

6.  Conjugate 


G  -*  G* 


A 

B 

-Ar 

- cr 

C 

D 

Bt  j 

dt\ 

7.  Inversion 


Suppose  D*  is  a  right  (left)  inverse  of  D.  Then  &  - 


A 


a  right  (left)  inverse  of  G. 


Suppose  G{s)  is  a  real-rational  transfer  matrix  which  is  proper,  i.e.,  ana¬ 
lytic  at  s=«.  Then  there  exists  a  state-space  model  ( A,B,C,D )  such  that 


The  quadruple  ( A.B.C.D )  is  called  a  realization  of  G.  A  realization  is 
minimal  if  A  has  minimal  dimension.  It  is  a  fact  that  a  realization  is  minimal 
if  and  only  if  ( A,B )  is  controllable  and  (C.A)  is  observable. 


A  basic  object  of  study  will  be  the  transfer  function  and  it  will  be 
assumed  to  have  a  realization.  The  next  section  describes  standard  opera¬ 
tions  on  linear  systems  in  terms  of  transfer  functions  and  their  realizations. 


0.2.2  Transfer  Functions 


Consider  the  linear,  time-invariant,  ordinary  differential  equation 
described  by 

x  -  As  -  Bu  (1) 

y  =  Cx  *  Du 

where  x(t )  e  IR  *  is  the  state,  u(t )  e  E  m  is  the  input,  and  y(t )  e  E  v  is  the 
output.  The  A,B,C,  and  D  are  appropriately  dimensioned  real  matrices. 

Associated  with  (1)  is  the  convolution  equation 

y(t)  =  (p*u)(0 

p(t)*  C**BU(t)  *  D6(t)  (2) 

and,  upon  taking  Laplace  transforms,  the  resulting  transfer  function  is 
y(s)  =  G(s)u(s) 

G(s )  =  C(sI-A)~'B  +  D  (3) 


To  expedite  calculations  involving  transfer  functions  the  notation 


a\ 

B 

cl 

D 

will  be  adopted.  Note  that 


A  B 
C  D 


C(sI-A)~lB  +  D  (4) 

is  a  real  block  matrix,  not  a  transfer  func¬ 


tion.  The  product  of  two  transfer  functions  is,  of  course,  the  cascade  of  the 
two  systems  or  just  the  multiplication  of  two  rational  matrices.  The  conven¬ 
tion  will  be  adopted  that  the  product  of  a  matrix  and  a  transfer  function  is  a 
transfer  function  defined  as 


Xu  X,g 

A 

li* 

xuA+xl2c 

XuB+X^D 

Xzi  Xz2 

pi 

X2]  A  r  Xj2  C 

X2lB-XaD 

(5) 


A  similar  convention  bolds  for  right  multiplication  by  a  matrix. 


The  system,  or  the  pair  (C* 4),  is  observable  if,  for  every  fi>0,  the  function 


y(t).  t  € 


Q.t  i ,  uniquely  determines  the  initial  state 


Theorem  1‘: 

The  following  are  equivalent; 
(i)  (C.A)  is  observable. 


(Si) 


The  matrix 


C 

\CA\ 

C4* 


has  independent  columns. 


(iii)  The  matrix 


\A-\A 

c 


has  independent  columns  for  all  X  in  C. 


(iv)  The  eigenvalues  of  A+HC  can  be  freely  assigned  by  suitable  choice  of  H. 

(v)  (A'.C)  is  controllable. 


The  system,  or  the  pair  (C.A),  is  detachable  if  A+HC  is  stable  for  some 
H. 


Theorem  Z: 

The  following  are  equivalent: 
(i)  (C.A)  is  detectable 


(ii) 


The  matrix 


has  independent  columns  for  all  Re  X>0 


(iii)  (i4',C)  is  stabilizable. 


0.2  Linear  Systems 

0.2.1  Controllability  and  Observability 

Consider  the  system 

x  =  Ax+Bu  ,  x(0)  =  0.  (1) 

The  system  or  the  pair  (A, 3)  is  controllable  if.  for  each  time  fi>0  and  final 
state  xJt  there  exists  a  (continuous)  input  u(*)  such  that  the  solution  of  (1) 
satisfies  x(f  ^  =  xv 

Theorem  1 

The  following  are  equivalent: 

(i)  ( A.B )  is  controllable. 

(ii)  The  matrix  [s,  AB.  has  independent  rows. 

(iii)  The  matrix  jd-X/,  f?j  has  independent  rows  for  all  X  in  C 

(iv)  The  eigenvalues  of  A+BF  can  be  freely  assigned  by  suitable  choice  of  F. 

The  matrix  A  is  said  to  be  stable  if  all  its  eigenvalues  satisfy  ReX<0.  The 
system,  or  the  pair  (A,B),  is  stabiiizable  if  there  exists  an  F  such  that  A+BF 
is  stable. 

Theorem  2 

The  following  are  equivalent: 

(i)  (A.B)  is  stabiiizable. 

(ii)  The  matrix  ji4-X/,f?j  has  independent  rows  for  all  ReXiO. 

We  will  now  consider  the  dual  notions  of  observability  and  detectability 


with  the  system 


2 


Lm(jRXmxn):  Banach  space  of  (essentially)  bounded  matrix-valued 

functions,  with  norm 


j|/’[|.  :=  ess  sup 


Hm(jTR,Cm*n ):  subspace  of  functions  F(s)  analytic  and  bounded  in  Re  s>0. 

Pf/t .  PHg  the  orthogonal  projections  from  LgORC"**")  onto 

//gO'R.CmXn)i-  respectively. 

Prefix  R  denotes  real-rational  and  the  prefix  B  denotes  the  unit  ball. 
The  symbol  R^mxn^  denotes  proper  real-rational  matrices.  Sometimes  the 
spaces  are  abbreviated  as  Ig(IR),  etc.  or  as  La,  etc.  when  context  determines 
the  arguments. 

The  Fourier  transform  yields  the  following  [isometric]  isomorphisms: 

£g(lR.  C"1*")  3  Z,8(jR  Cm*n) 

H2( 3R.  CmKB)  =  HaW R.  C"1*") 

H2(K  CmXn)iS  ffg(jR  Cw,w)i: 

The  norms  on  these  spaces  are  all  denoted  by  in 

A  useful  fact  is  that  the  norm  of  a  matrix  C  in  Lm(jTR,  Cm*n)  equals  the 
norm  of  the  corresponding  multiplication  operator 


/-*£?/:  igO'R  Cn)  -*  LafjTR,  Cm); 


that  is, 


G  m  =  sup  G/  \z-.f  elgO'R  C"),  \'\f  Ul. 


It  also  equals  the  norm  of  the  operator  restricted  to  H2(jlR,  CB): 


0.1.1  Function  Spaces 


Continuous  time  domain 

i/2(F-,Cmxn):  Hilbert  space  of  matrix-valued  functions  on  IR,  with  inner 
product 

</.$>:=  f  traee|/  (f  )*p(f  )]df. 

/f2(IR,C’n*n):  subspace  of  functions  zero  for  t  <0. 

j?2(lR,Cmx’‘)l  :  subspace  of  functions  zero'  for  f  >0. 

Pnt  and  PH i:  the  orthogonal  projections  from  i2(R,Cm>m)  onto 
/f2(K.Cm>m).  H2(KCmm)^  respectively. 

Continuous  frequency  domain 

;  jTR:  imaginary  axis. 

Lz(j R,cm*B):  Hilbert  space  of  matrix-valued  functions  on  j IR,  with  inner 
product 

<r-G>  -  ±rb  ace  [/W  <?(;«■>)]  do. 

Nz(J^R,Cmxn):  subspace  of  functions  F(s)  analytic  in  Re  s>0  and  satisfying 

su tracej^Xo  +  j o)*F{o+j u)Jdi>  <  » 


X,,D,  2,X,,'  ^  ^ \  ~  0 

An'S,  +  EiA,,  +  Cx'Ci  -  0. 

It  can  also  be  shown  (Silverman  and  Pemebo)  that  a  minimal  realization  for 
Gr  is  stable,  although  in  certain  (non-generic)  cases  A„  may  have  uncontroll¬ 
able  or  unobservable  jw-axis  eigenvalues. 

Lemma  1  Product  of  Positive  Semi-Definite  Matrices  is  Similar  to  a  Positive 
Semi-Definite  Matrix 

Proof:  Let  X  and  Y  be  positive  semi-definite.  First  perform  an  orthogonal 

transformation  so  that 

fA,  0  Y\\  Yu 

X  l  0  0  A1>0  dia«onai'  Y  ■*  Yxi  Yn  J 0 

[A1r„Alru| 

By  this  transformation  XY  is  similar  to  q  0‘  ”ow 

A, Fu  A, Ya  [A?  0  Apr,, A?  A fYa  Af»  0 

,0  0  =  0  /Jl  0  0  Jl  0  /]• 

and  it  is  easy  to  find  a  matrix  Z  such  that 

A^uA?  o]  /  Z  kfYt , A?  Apr, 8  /  -Z 
0  oj  =  (o  /][  0  0  j[o  / 

( Z  exists  because  the  columns  of  Apr,,  span  the  columns  of  Apr,z  owing  to 
the  fact  that  Y  is  positive  semi-definite).  The  left  hand  side  of  this  last  equa¬ 
tion  is  positive  semi-definite  and  similar  to  XY.  If  XY* 0  it  is  possible  to  find 
a  matrix  71,  such  that 


TrlABT1  =  A  =  diag|(Ai . X*.  0 . Oi 
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where  X^Xj^  ■  •  •  *t\x>0. 


Q.E.D. 


Now  consider  two  gramians  X  and  Y.  Let  us  suppose  XY* 0,  so  that  Ti 
can  be  chosen  as  above: 


Tr'XYTi  =  A  =  diag 


!(Ai . Xjr.0 . 0) 


where  X^Xjfe  ■  ■  ■  >\K>0.  Under  this  transformation  the  gramians  become 

q  =  Tr'x(Trly .  r  =  tx'ytu 

and  QR-L  Because  Q  and  R  are  symmetric,  RQ-R-h-QR  and  so  Q.R  and  A 
commute.  Both  Q  and  R  must  leave  the  eigenspaces  of  A  invariant  and  so 
are  of  the  form 


Q  ~  diag 
R  ~  diag 


Q  . . ft . 

K& . 


where  ft  is  a  square  matrix  whose  size  equals  the  dimension  of  the  eigen* 
space  of  A  and  EF=0  where  E  and  F  are  square  matrices  the  size  of  the  ker¬ 
nel  of  A  Of  course,  all  the  ft's  are  summetric  so  it  is  possible  to  find  an 
orthogonal  matrix 


W  =  diag 


♦l 


such  that  and  are  diagonal.  Note  that  this  same  IT 

gives  a  diagonal  W'RW  and  leaves  A  alone. 


0.3.2  Inner  Transfer  Functions 


Let  G 


A  B 

~  [c  D  ■ 


Then  G  is  inner  if  G*G=I  and  co-inner  if  GGm-I.  Note 


that  G  need  not  be  square.  Inner  and  co-inner  are  dual  notions  and  are  often 
called  all-pass. 

If  G  e  Rg*™ ,  p>m  is  inner  then  any  Cj_  £  is  called  a  comple¬ 

mentary  inner  factor  (CIF)  if  |g  Gj|  is  square  and  inner.  The  dual  notion  of 
complementary  co-inner  factor  is  defined  in  the  obvious  way. 

The  following  lemma  is  useful  in  characterizing  inner  transfer  functions 
in  terms  of  a  realization. 

Lemma  1.  Suppose  5  X=X  £  HR"”1  such  that 

i)  A  X  -  XA  -  C'C  -  0 

ii)  B'X  ^  D'C*  0 

Then  GmG  =  D'D. 

Proof:  Suppose  that  i)  and  ii)  hold.  Then  conjugating  the  state  of 


A  0 

B 

-C'C  -A' 

-CD 

,  D'C  B' 

D'D 

I  0 

by  [  on  the  left  and 


7  0'1  7  0 

-X  l\  ~  [AT  / 


on  the  right  yields 


A  OB 

G*G  =  -(A'X+XA  +  C'C)  -A’  -(XB+CD)  . 
B'X+D'C  W  D'D 


Now.  applying  i)  and  ii)  yields 

A  0  B 
GmG  =  0  -A'  0 

0  B'  D-D 
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=  D'D 


By  duality,  we  have  the  following 

Lemma  1*  Suppose  3  Y-Y  e  JR"*"  such  that 

i)  AY  YAr  ♦  BB'  =  0 

ii)  CY  *  DB'  =  0 

Then  GG*-DD'. 


These  two  lemmas  immediately  lead  to  one  characterization  of  inner 
matrices  in  terms  of  their  state  space  representation.  Simply  add  the  condi¬ 
tion  that  D'D-1  ( DD'-I )  to  lemma  1  (1‘)  to  get  G*G=I  ( GG*-I ).  Further¬ 
more,  by  adding  a  few  additional  assumptions,  the  conditions  in  the  lemmas 
become  necessary  as  well  as  sufficient.  This  leads  to  the  following  complete 
characterization  of  stable  inner  transfer  functions  in  terms  of  a  minimal 
realization. 


Suppose  G- 


is  stable  and  minimal. 


Then  the  gramians  X  and  Y 


satisfying 


AX  +  XA  +  C'C  =  0 

(2) 

AY  -  YA'  *  BB'  =  0 

(3) 

exist  and  are  unique. 


Corollary  1  G  is  inner  iff 

i)  B'X  t  D'C  -  0 

ii)  D'D  =  I 


Corollary  1*  G  is  co-inner  ill 

i)  CY  +  DB'  =  0 

ii)  DD'  =  / 

Proof  Sufficiency  of  i)  and  ii)  follows  immediately  from  the  lemmas.  For 
necessity,  suppose  G*G=I.  From  1)  and  2)  this  implies  that 


Since  ( A,B )  is  controllable,  (4)  implies  that  B'X+D'C= 0.  The  co-inner 
case  follows  by  duality. 

This  characterization  of  inner  transfer  functions  is  from  [AnV]  and  plays 


an  important  role  in  the  synthesis  theory.  It  allows  the  construction  of  inner 
transfer  functions  by  solving  algebraic  equations. 


0.4  Linear  Fractional  Transformations 


Suppose  P  =  pi  e  .  Ac  B,.Ke  !?**•  We 


(1) 


will  adopt  the  notation 

Tt(P.K)  i  P n  +  PuKV-PzzlO-'Pzx 

and 

F„(P,A)  I  PB  +  PzlL(I-Pn*)-lPiz  (2) 


The  linear  fractional  transformations  (LFT)  are  illustrated  in  Figure  1. 
The  l  denotes  that  the  second  argument  is  fed  back  in  the  lower  block,  and 
the  u  denotes  feedback  in  the  upper  block. 


An  important  property  of  LFTs  is  that  any  interconnection  of  LFTs  is 

Uu  J ial 


again  an  LFT.  Suppose  J  =  j 


22 


.  Then 


Ti{P.Ti(J.Q))  =  Yl(T,Q) 
Ftt(/.Ftt(F.A))  =  F„(7\A) 


(3) 

W 


where 


T  = 


7*11  7”l2 
Tzi  ^22 : 


Pn  “  Pi2^iiU~Psz^u)~lPzi  Piz{f~JuP&)~lJu 
j  tlU~P  ZiJ  n)~lP 21  *^22  +  Jz\PztU"~J \\P  Zl)~X  J \V. 


(5) 


Equations  (3)  and  (4)  are  illustrated  in  Figure  2.  Note  that  if  Ft (J,Q)  is  a 
parametrization  of  a  controller,  Fi(T,Q)  is  affine  if  and  only  if  722=0.  This 
type  of  controller  parametrization  will  play  an  important  role  in  the  syn¬ 
thesis  theory. 
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1.  General  Framework 
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1.  Introduction 
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3.  SSV  Analysis  of  Systems 


4.  A  Glimpse  at  Synthesis 


1.1  Introduction 


This  part  of  the  notes  briefly  reviews  recent  results  on  the  problem  of 
analyzing  the  performance  and  robustness  properties  of  systems 
([Doy2],[DWS]).  The  main  goal  is  to  motivate  from  a  control  theory  point  of 
view  the  synthesis  problems  considered  in  part  2  of  these  notes.  A  secondary 
goal  is  to  describe  a  general  approach  to  control  problems  which  is  intended 
to  provide  the  foundation  for  a  new  paradigm  for  control  theory  broader  in 
scope  and  content  than  that  provided  by  Classical  or  Modern  Control  Theory. 
An  important  aspect  of  this  new  paradigm  is  the  treatment  it  gives  to  model 
uncertainty. 

Modem  Control  Theory,  the  dominant  paradigm  for  the  past  20  years, 
has  its  basis  in  Stochastic  Optimal  Control  and  Estimation  Theory  [LQG].  This 
theory  essentially  restricts  model  uncertainty  to  additive  noise.  The  theory 
provides  a  methodology  for  analyzing  the  impact  of  noise  on  system  perfor¬ 
mance  and  synthesizing  to  reduce  that  impact 

The  inadequacies  of  this  view  of  uncertainty  became  widely  accepted  in 
the  late  1970' s,  as  robustness  to  plant  uncertainty  became  a  major  theme  in 
the  Modem  Control  Theory  community  [LMC].  Ironically,  this  involved  a 
renewed  interest  in  the  Classical  Control  paradigm  ([Bod],[Kor]),  which 
Modem  Control  displaced  within  the  theoretical  community  (if  not  among 
practicing  engineers).  This  new  direction  provided  useful  design  tools,  includ¬ 
ing  Singular  Value  Analysis  and  Multivariable  Loop  Shaping 
([DStl],[Doyl],[DSt2]). 

While  providing  an  important  perspective,  as  well  as  practical  tech¬ 
niques.  the  methods  based  on  singular  values  still  require  rather  restrictive 
assumptions  about  uncertainty.  In  particular,  plant  uncertainty  must 


essentially  be  modelled  as  a  single  "unstructured  perturbation." 

The  Structured  Singular  Value  (SSV),  /x,  was  developed  several  years  ago 
to  correct  this  deficiency  in  singular  values  ([Doy2],[DWS]).  In  the  context  of 
the  general  framework  discussed  in  this  memo,  the  SSV  provides  a  very 
powerful  mathematical  tool  for  the  analysis  of  complex  systems.  Indeed,  we 
believe  that  this  framework  together  with  the  SSV  and  the  synthesis  tech¬ 
niques  discussed  later,  has  the  potential  to  form  the  basis  for  a  new  para¬ 
digm  for  control  theory. 

The  remainder  of  this  part  of  the  notes  describes  the  general  framework 
for  control  system  analysis  and  synthesis  which  includes  all  the  viewpoints 
discussed  as  special  cases.  In  particular,  the  assumptions  about  uncertainty 
required  by  each  methodology  are  compared.  In  this  context,  the  words 
analysis  and  synthesis  have  specific  meanings. 

Analysis  is  used  to  describe  the  process  of  determining  whether  a  given 
system  has  the  desired  characteristics.  In  general,  this  may  range  from  the 
use  of  mathematical  tools  to  simulation  to  experimentation,  although 
analysis  is  typically  applied  primarily  to  describe  the  former.  Synthesis,  on 
the  other  hand,  is  the  process  of  finding  a  particular  system  component  to 
achieve  desired  characteristics,  which  are  typically  expressed  in  terms  of 
some  analysis  tools.  Analysis  and  synthesis  are  just  two  aspects  of  the  more 
general  problem  of  engineering  design. 

The  discussion  which  follows  first  considers  analysis,  then  briefly 
touches  on  synthesis.  The  next  part  on  Synthesis  Theory  will  take  up  that 
question  in  more  detail. 


1.2  Analysis  Framework  and  Background 

1.2. 1  General  Framework 

Various  modelling  assumptions  will  be  considered  and  the  impact  of 
these  assumptions  on  analysis  and  synthesis  methods  will  be  explored.  Con¬ 
sider  the  diagram  in  Figure  1.  This  is  the  general  framework  to  be  con¬ 
sidered.  Models  of  this  form  are  typically  constructed  from  components 
which  also  have  this  form.  The  nominal  model  provides  the  basic  intercon¬ 
nection  structure  between  the  signals,  perturbations  and  controller,  as 
shown.  It  has  three  inputs  and  outputs,  each  consisting  of  a  vector  of  signals. 

As  typical  examples,  consider  the  following  filtering  and  control  prob¬ 
lems.  First,  a  simple  filtering  problem  is  given  in  the  diagram  in  Figure  2. 
This  may  be  rearranged  as  shown  in  Figure  3  to  fit  the  general  framework.  In 
order  to  simplify  the  diagram,  no  perturbation  was  included.  A  typical  con¬ 
trol  problem  might  look  like  the  diagram  in  Figure  4  where  again,  for  simpli¬ 
city,  no  perturbations  are  included.  This  too  can  be  rearranged  to  fit  the 
general  framework,  although  the  diagram  is  complicated. 

Any  system  may  be  rearranged  to  fit  the  form  of  this  general  frame¬ 
work.  Although  the  interconnection  structure  can  become  quite  complicated 
for  complex  systems,  many  software  packages  are  available  which  could  be 
used  to  generate  the  interconnection  structure  from  system  components. 

Note  that  uncertainty  may  be  modelled  in  two  ways,  either  as  external 
inputs  or  as  perturbations  to  the  nominal  model.  The  performance  of  a  sys¬ 
tem  is  measured  in  terms  of  the  behavior  of  the  outputs  or  errors.  The 
assumptions  which  characterize  the  uncertainty,  performance  and  nominal 
model  determine  the  analysis  techniques  which  must  be  used. 


The  most  fundamental  assumption  that  is  made  throughout  is  that  the 
nominal  model  is  a  finite  dimensional  ordinary  differential  equation  and  is 
linear  and  time  invariant  (LTIODE).  The  uncertain  inputs  are  assumed  to  be 
either  filtered  white  noise  or  weighted  Lj,  signals.  Performance  is  measured 
as  weighted  output  variance  or  weighted  output  Lp  norm.  The  perturbations 
are  assumed  to  be  themselves  LTIODE’ s  which  are  norm- bounded  as  input- 
output  operators.  Various  combinations  of  these  assumptions  form  the  basis 
for  all  the  standard  linear  systems  analysis  tools. 

Given  that  the  nominal  model  is  an  LTIODE,  the  interconnection  system 
has  the  form 


and  the  total  system  is  a  linear  fractional  transformation  on  the  perturbation 
and  the  controller  given  by 

e  =  T%(Tl'P.K).6)u 

=  TMP.V.mu  (2) 

Since  the  focus  of  the  current  discussion  is  on  analysis  methods,  the 
controller  may  be  viewed  as  just  another  system  component  and  absorbed 
into  the  interconnection  structure.  Thus  the  analysis  framework  reduces  to 
the  diagram  ir  Figure  5  where 

e  =  Fu(F.A)u 

=  [p«  +  « 


P\\  P  1a  ^13 
P  -  Pz  1  Pzz  Pz 3 

Pzi  Psz  -P33 


(3) 


Note  that  the  P's  in  (2)  and  (3)  are  not  necessarily  the  same.  Table  1 
and  the  discussion  which  follows  summarize  the  various  assumptions  and 
resulting  analysis  and  synthesis  tools.  In  each  case,  stability  of  the  nominal 
must  be  evaluated.  Since  P  is  assumed  to  have  the.  state-space  representa¬ 
tion 


this  may  be  done  by  checking  that  all  eigenvalues  of  A  lie  in  the  open  lbp. 
There  are  alternatives  to  this  approach  but.  for  simplicity,  it  will  be  assumed 
that  the  nominal  plant,  with  controller  is  closed  loop  stable  in  the  sense  that 
all  eigenvalues  of  A  are  in  the  open  lhp. 

Given  nominal  stability,  the  entries  in  the  table  may  be  interpreted  as 
filling  in  the  following  general  performance/robustness  theorem; 


Consider  the  system  in  Figure  2.  Stability  and  performance  analysis  of 
this  system  requires  a  new  matrix  function,  the  structured  singular  value 
(SSV).  denoted  by  al  Before  proceeding  with  Case  3,  a  digression  to  discuss  /a 
will  be  taken.  For  details,  see  [Doy2]. 


1.3  Structured  Singular  Values 


1.3.1  Introduction 

This  chapter  considers  the  problem  of  stability  with  structured  uncer¬ 
tainty  and  of  performance  in  the  presence  of  structured  uncertainty.  Typi¬ 
cally.  uncertainty  is  present  throughout  a  system.  Suppose  that  a  system  is 
built  from  components  which  are  themselves  uncertain  and  that  component 
uncertainty  is  modelled  as  norm-bounded  perturbations.  This  situaton  can 
be  rearranged  to  fit  the  general  framework  but  the  perturbation  for  the  total 
system  has  structure.  This  can  be  seen  schematically  in  Figure  1. 

Note  that  the  interconnection  model  P  can  always  be  chosen  so  that  A  is 

i >  ti 

block  diagonal,  and  by  absorbing  any  weights,  A  i.<l.  The  results  of  Case 
2b  can  be  applied  in  two  ways: 

1)  Pn'  _S1  implies  stability,  but  not  conversely.  This  can  be  arbitrarily 
conservative,  in  that  stable  systems  can  have  arbitrarily  large  j  Pn  jj  . 

2)  Test  for  each  A,  individually.  This  can  be  arbitrarily  optimistic  because 
it  ignores  interaction  between  the  Ai. 

The  difference  between  the  bounds  obtained  in  1)  and  2)  can  be  arbi¬ 
trarily  far  apart.  Only  when  they  are  close  can  conclusions  be  made  about 
the  general  case  with  structured  uncertainty. 

These  two  limitations  of  Case  2  (and  1)  have  motivated  much  of  the 
research  described  in  these  notes.  The  result  is  a  new  paradigm  described  in 
Case  3.  The  problem  in  Case  3  involves  exactly  that  of  structured  uncer¬ 
tainty. 


inputs  when  A  =  0  but  response  when  A^O  is  not  known.  Only  crude  bounds 
can  be  obtained  with  the  methods  of  Case  2. 

An  additional  limitation  of  Case  2b  is  that  all  plant  uncertainty  must  be 
modelled  as  a  single  norm-bounded  perturbation.  Typically,  uncertainty  is 
present  throughout  a  system.  Suppose  that  a  system  is  built  from  com¬ 
ponents  which  are  themselves  uncertain  and  that  component  uncertainty  is 
modelled  as  norm-bounded  perturbations.  This  situation  can  be  rearranged 
to  fit  the  general  framework  but  the  perturbation  for  the  total  system  has 
structure.  The  problem  of  structured  uncertainty  is  taken  up  in  the  next 
chapter. 


feedback  problems  where  both  types  of  uncertainty  have  significant  impacts 
on  system  performance.  Case  2  has  attracted  a  great  deal  of  research 
interest  recently,  and  is  currently  a  popular  new  paradigm  in  the  multivari¬ 
able  control  community.  Although  implicit  in  the  methods  of  classical  con¬ 
trol  ([Bod],[Kor])  and  some  modern  work  (e.g.  Zanies'  conic  sector  theory 
[Zaml]  and,  more  generally,  the  input-output  stability  theory  of  the  1960's 
[DeV]),  the  approach  did  not  gain  wide  attention  until  the  late  '70's. 

The  current  interpretation  is  a  consequence  of  research  done  in  the  late 
'70's.  ([Doyl],[DSt2],[LMC]).  This  interpretation  involves  singular  values  as 
an  analysis  method  and  singular  value  loop  shaping  as  a  synthesis  approach. 
The  so-called  LQG  Loop  Transfer  Recovery  (LQG/LTR)  combines  the  synthesis 
methods  of  Case  1  with  the  analysis  methods  of  Case  2  to  produce  a  hybrid 
synthesis  method.  This  gives  an  ad  hoc  approach  to  Case  2  that  can  be 
effective  for  many  multivariable  problems. 

Another  approach  to  synthesis  for  Case  2  is  the  so  called  Hm  or  Lm/  Hm 
methods  introduced  to  the  control  community  by  Zames  [Zam2]  and  Helton 
[Kel],  and  developed  further  by  many  others  (eg.  [FKZ]).  The  Lm/Hm 
methods  for  Case  2  are  analogous  to  the  L?/  Hz  methods  of  Case  1  with  the 
exception  that  for  Case  2  the  Lm  rather  than  Lz  norm  is  optimized.  The  solu¬ 
tion  to  the  general  Lm/  Hm  problem  will  be  presented  in  the  Synthesis  part  of 
these  notes. 

The  main  objection  to  Case  2  is  the  restrictive  assumptions  about  uncer¬ 
tainty  (recall  this  was  also  the  objection  to  Case  1).  Although  case  2  allows 
both  uncertain  inputs  and  perturbations,  analysis  can  be  performed  for 
either  individually  but  not  both  together.  Thus  a  system  can  be  shown  to 
remain  stable  when  perturbed  and  have  acceptable  response  to  uncertain 


some  additional  technical  difficulties  and  is  not  the  focus  of  these  notes. 


The  following  theorem  treats  internal  stability 

Theorem  2  FU'(P,A)  is  internally  stable  for  all  A  £  BRHm 

Iff  INI.*! 

Note  that  input-output  stability  of  FU(P,A)  is  not  necessarily  the  same  as 
internal  stability.  In  particular,  the  following  statement  is  not  true: 

Not— A— Theorem  |e  j2<»  for  all  |ju||g£l  and  A  £  BRHm 

nr  11*4*1 

Counterexample  Suppose  jjpu  !_>1  but  PgjaO. 

From  now  on  stability  will  mean  internal  stability,  but  be  denoted  by 
j|4<-  in  the  table,  even  though  this  is  definitely  an  abuse  of  notation. 
Note  that  generieally  this  distinction  between  internal  and  i-o  stability  does 
not  exist. 

As  in  Case  1,  it  is  essential  to  allow  weights  on  inputs,  outputs  and  per¬ 
turbations.  As  before,  these  weights  may  be  absorbed  into  the  nominal 
model.  This  allows,  without  the  loss  of  generality,  the  use  of  signals  and  per¬ 
turbations  which  are  in  unweighted  unit  balls.  Thus  implementation  of  the 
analysis  tools  requires  only  a  method  for  constructing  interconnected  sys¬ 
tems  and  a  method  for  evaluating  the  appropriate  norm.  The  former  applies 
to  all  cases,  whereas  the  latter  requires  a  diflerent  norm  in  each  case. 

Note  that  in  Case  2  both  uncertain  Inputs  and  uncertain  plants  can  be 
handled  with  the  same  analysis  tool.  This  approach  is  particularly  useful  for 


1.2.3  L_  Frequency  Domain  Methods 


Case  2  involves  an  Attempt  to  correct  some  of  the  deficiencies  of  Case  1 
by  moving  to  an  unknown  but  bounded  (in  an  Lg  sense)  framework.  This 
allows  both  types  of  plant  uncertainty  to  be  handled  in  a  common  framework, 
albeit  in  a  limited  manner. 

Case  2a  is  an  Lg  version  of  Case  la.  The  input  is  constrained  to  lie  in 
BLg  as  a  time  signal  (unit  ball  in  Lg)  and  the  performance  is  specified  in 
terms  of  the  output's  Lg  norm.  'With  no  perturbation,  the  analysis  test 
involves  simply  the  Lg  induced  operator  norm,  Le.  I.  on  the  transfer  func¬ 
tion  Pgg. 

The  GAT  in  this  case  is 


Theorem  1 


lej'js;  1  for  all  tu1  1 

1 1  i  r  1 1  1 1* 


\Pa\L*  1 


Although  this  theorem  is  a  trivial  restatement  of  the  definition  of  induced 
norm,  it  means  that  the  analysis  test  is  an  exact  characterization  of  the  per¬ 
formance  requirement. 

Case  2b  is  significant  departure  from  the  previous  three.  It  involves 
maintenance  of  stability  in  the  presence  of  perturbations.  The  block 
diagram  for  F *'(.P.A)  is  shown  in  Figure  1.  There  are  many  ways  to  state  the 
GAT  for  this  case,  depending  on  the  desired  notion  of  stability  and  assump¬ 
tions  on  A.  The  distinctions  are  somewhat  subtle,  but  are  important  from  a 
theoretical  point  of  view.  Nevertheless,  they  do  not  significantly  impact  the 
application  of  the  theory. 


The  A’s  are  assumed  to  be  LTIODE's,  so  that  A  e  RHm.  The  assumptions 
A  e  Cor  A  e  CHm.  give  the  same  result.  The  distributed  cue,  A  €  Hm,  causes 


1.2.2.  Stochastic 


Case  la  involves  unit  covariance  white  noise  input  with  output  variance 
as  the  evaluation  criteria.  Since  no  perturbation  is  allowed,  the  problem 
reduces  to  the  diagram  in  Figure  1  and  E(e  Te )  =  |  J P&  1 |*.  Note  that  colored 

noise  or  weighted  variance  could  be  used  as  shown  in  Figure  2.  Ihis  reduces 
to  the  general  case  by  absorbing  the  weights  Vl  and  into  Pa  as 
Pzl  -  WzCW\.  In  practice,  it  is  essential  to  use  weights  to  reflect  spatial  and 
frequency  variations  in  inputs,  perturbations  and  output  specifications,  but 
in  every  case,  these  weights  may  be  absorbed  into  nominal  model. 

In  Case  lb  the  input  is  an  uncertain  delta  function,  which  is  equivalent 
to  uncertain  initial  conditions.  The  performance  specification  is  the 
expected  value  of  the  Lj-norm  of  the  output. 

Case  1  forms  the  foundation  of  Stochastic  Optimal  Control  Theory'.  Case 
la  includes  the  standard  linear  stochastic  filtering  problem  and  Case  lb 
includes  the  standard  linear  quadratic  optimal  control  problem.  These  are 
combined  to  obtain  the  full  LQG  problem,  which  is  again  Case  la.  These 
assumptions  and  resulting  analysis  methods  have  been  the  dominant  para¬ 
digm  in  the  control  community  for  over  20  years. 

The  development  of  this  paradigm  has  stimulated  extensive  research 
efforts  and  been  responsible  for  important  technological  innovation,  particu¬ 
larly  in  the  area  of  estimation.  The  theoretical  contributions  include  a 
deeper  understanding  of  linear  systems  and  improved  computational 
methods  for  complex  systems  through  state-space  techniques.  The  major 
limitation  of  this  theory  is  the  lack  of  formal  treatment  of  uncertainty  in  the 
plant  itself.  By  allowing  only  additive  noise  for  uncertainty,  the  stochastic 
theory  ignored  this  important  practical  issue.  Plant  uncertainty  is  particu- 


1.3.2  SSV lor  Constant  Matrices 

The  problem  is  to  test  for  def  (/-/«/ A)*0  for  sets  of  A.  Two  standard 
results  are 

1)  def  (/-.&A)*0  V  A  £  A  ja(A)<l 
iff  5(Af)  st  1 

2)  det(I-M£>)*0  v  A  £  ■  X/  j  X  E  C,  jxjci 

iff  p(M )  £  1  where  p(M)  =  max  jXi(Af)  j 

As  a  generalization,  consider  a  function  y  with  the  properties  that 
y(oJA)=-  ja  \y(M)  and 

3)  def(/-AfA)*0  V  A  £  ding  (Aj.Aj,  .  .  .  ,  A*)  j  ff(Ai)  <  l| 
iff  y(l  *1 

Obviously,  y  is  a  function  of  M  which  depends  on  the  structure  of  A  . 

To  be  precise,  a  multi-index  could  be  constructed  which  would  specify  the 

'  • 

structure  of  A  and  y  would  depend  on  that  index.  For  this  informal  dis¬ 
cussion,  just  keep  in  mind  this  fact  and  assume  that  a  structure  is  specified. 
Clearly  3  and  p  are  special  cases  of  y  for  particular  structures  as  indicated 
above.  Furthermore,  for  any  structure 

p{H)  sS  y(M)  s!  (4) 

Given  these  bounds,  how  important  is  yP.  The  answer  can  be  clearly  seen 


from  the  following  examples: 


6X  0 

Suppose  i=  (j  ^  and  consider 

1)  Af=  o  l  p{M)= 0 

det(I-ML)  =  l  so  p(/«f)=0 
_iz  u 

2)  /<f  =  ^  pM=0  5(^)  =  1 

det{I-Mb>)  =  1  -  so  p(#)=l 

C 

Thus  neither  p  nor  5  provide  useful  bounds  even  in  simple  cases.  The 
only  time  they  do  provide  reliable  bounds  is  when  p*ff.  Thus  better  bounds 
on  p  are  needed  to  pursue  the  problem  in  Case  3. 

For  the  rest  of  the  discussion  fix  a  structure  for  the  fi’s  as 


where  the  sets  SL  and  J2  match  the  structure  of  X  Note  that  the  &  and  U. 
leave  X  invariant  in  the  sense  that 


3 

1)  iej,  U  e  SI  imply  5(A£7)  =  5(£/A)  =  ?(A) 

2)  A  eX  £  ei2  imply  DM)'1  =  A 

From  these  two  properties  and  the  definition  above  expression  for  p,  one 
immediately  obtains 

max  p(MU)  £  p(M)  ^  inf  Tt{DMD~x) 

rJ  eJI  1  Ds.1 1  v 

The  first  important  theorem  about  p  is 
Theorem  1  max  p(MU)  —  p(M) 

This  theorem  expresses  p  in  terms  of  familiar  linear  algebraic  objects. 
Unfortunately,  the  implied  optimization  problem  is  nonconvex  so  it  does  not 
immediately  yield  a  computational  approach.  The  second  important 
theorem  is 

Theorem  2  If  n<:3  p(M)=  inf  TS(DMD~X) 

D  f.1 } 

This  theorem  states  that  if  there  are  3  or  fewer  blocks  (  no  restriction 
on  size),  then  p (M)  is  just  5  of  a  block  diagonal  similarity  of  U.  Furthermore 
is  convex  in  D  so  that  the  inflmum  can  be  found  by  search  over 
n— 1  real  parameters. 

The  theorem  is  not  true  for  n&4,  but  it  is  conjectured  that 
^inf^  ~d(DHD~l)  still  provides  a  reasonably  tight  bound  for  p.  Also,  many 

problems  of  interest  have  3  or  fewer  blocks  so  this  provides  a  reasonable 
computational  scheme. 

Another  important  aspect  of  this  theorem  is  that  p  may  be  viewed  as  9 
plus  scaling.  Thus  the  general  synthesis  methods  recently  developed  to 
optimize  the  Lm  norm  (i.e.  5)  may  be  applied,  via  scalings,  to  optimize  p. 
This  will  be  discussed  more  in  the  syhthesis  section.  Now  back  to  Case  3. 


1.3.3  S5Y Analysis  of  Systems 


Abuse  notation  and  define 


H  L  =  sup  f*(M(ju)). 


Although  |j»|jM  is  not  a  norm,  this  will  be  convenient.  Recall  that  j  j/^{ ^ 
is  a  function  of  IA  which  also  depends  on  the  assumed  structure  of  the  per¬ 
turbations. 

Case  3a  involves  stability  in  the  presence  of  structured  perturbations 
and  the  result  is  analogous  with  Case  2b.  In  fact,  3a  reduces  to  2b  in  the  case 
that  there  is  a  single  block  in  the  perturbation.  Suppose  that  A  e  BRHm  and 
the  A's  have  the  structure  L-daag  (Aj.A2.A3 . A„).  The  GAT  for  Case  3a  is 


I!*  L 13 


Theorem  1  Ftt(/5.A)  is  internally  stable  for  all  structured  A  £  BRHm 

«  KIL  * 1 


Case  3b  puts  everything  together  and  is  really  the  payoff  for  fx  analysis.  The 
problem  is  to  check  that  jje  8<1  is  satisfied  for  all  jjuj  8sSl  and  all  struc¬ 
tured  perturbations.  Recall  that  from  2a  and  2b  that  both  stability  with  a 
single  perturbation  and  performance  with  i2  inputs  involve  the  same  test 
using  |j*|j—  although  on  different  parts  of  the  system.  This  means  that  the 
system  in  Figure  1  has  internal  stability  and  j]e|[jssi  for  all  Jjuj  2s:i  and 
t-diog (Aj.Az,  ...  .An)  €  BRHm  if  and  only  if  the  system  in  Figure  2  has 
internal  stability  for  ail  structured  A  and  all  An+j  e  BRHm.  This  is  exactly 

Case  3a  with  the  structure  L-diog (Aj.Aj . An.An+j).  Using  this  structure 

for  fx  yields  the  following: 


2 


Theorem  2  FU(P,&)  is  internally  stable  and  |e|jg£l  for  all 
llulLsi  and  A=£fia£(Ai.Aa . A„)  e  BRHm 


This  is  a  remarkably  useful  theorem.  It  says  that  ]  [p]  sc  1  implies  not 
only  stability  for  all  structured  perturbations  but  also  that  ||e  |2sC  1  for  all 
114  £  1  and  all  structured  perturbations.  Furthermore.  |  |p|  >  1  implies 


that  there  exists  a  u  with  |  pi  |  jz  £  1  and  a  structured  A  such  that  either 
||ej  2>  1  or  FU(F,A)  is  internally  unstable.  This  is  the  first  general  result 
which  guarantees  performance  for  a  whole  set  of  plants  and  gives  an  exact 
(nonconservative)  analysis  test. 


1.4  A  Glimpse  at  Synthesis 

This  will  be  a  sketchy  outline  of  the  new  synthesis  results.  The  details 
are  somewhat  complicated  and  are  treated  in  Part  2  which  is  devoted  to  the 
synthesis  theory.  At  this  point,  we  simply  want  to  point  out  how  the  analysis 
theory  discussed  in  this  part  leads  naturally  to  certain  synthesis  questions. 

From  the  analysis  results,  we  see  that  each  case  boils  down  to  evaluating 

IhrIL  0=2."  or  M  (1) 

for  some  transfer  function  P^ .  Thus  when  the  controller  is  put  back  into  the 
problem,  it  involves  just  a  simple  linear  fraction  transformation  as  shown  in 
the  diagram  in  Figure  1.  (Note:  the  Py's  here  are  not  the  same  as  the  Pv‘ s 
in  the  previous  sections) 

Each  case  then  leads  to  the  synthesis  problem 

nrni|jFi(P./05|a  for  a=2,<*.  or  ti  (2) 

subject  to  internal  stability  of  the  nominal.  Here 

np.jo^Pu+PuW-Ptgfn-'Pzv 

The  solution  of  this  problem  for  a-2  and  «®  is  the  focus  of  Part  2  on  Syn¬ 
thesis  Theory.  The  solution  presented  there  unifies  the  two  approaches  in  a 
common  synthesis  framework.  The  a  =  2  case  was  already  known  and  the 
results  are  simply  a  new  interpretation.  The  o=°°  case  had  been  solved  only 
for  special  cases  where  Pja  and  P^  are  square.  Also,  the  existing  solutions 
did  not  have  computational  schemes  allowing  their  use  on  even  moderately 
sized  problems.  These  two  limitations,  especially  the  former,  restricted  the 
application  of  the  pioneering  H.  methods  to  fairly  simple  problems,  such  as 
sensitivity  minimization.  The  new  solution  eliminates  these  two  limitations. 


bnfortunately,  this  new  solution  for  the  Hz  and  Hm  suffers  from  the 
same  limitations  imposed  by  restrictive  assumptions  about  uncertainty  as  do 
the  underlying  analysis  methods.  While  the  SSV  is  a  great  improvement  for 
analysis  (Case  3).  synthesis  for  the  a=/i  case  is  not  yet  fully  solved.  Recalling 


that  /x  may  be  obtained  by  scaling  and  applying  U  j.,  a  reasonable  approach 


is  to  "solve" 


by  iteratively  solving  for  K  and  D.  With  either  K  or  D  fixed,  the  global 
optimum  in  the  other  variable  may  be  found  using  the  p  and  Hm  solutions 
described  previously.  Example  designs  have  been  done  and  this  scheme 
seems  to  work  well,  but  global  convergence  is  not  guaranteed.  In  fact,  a 
counterexample  has  been  constructed  where  (3)  reaches  a  local  minimum 
which  is  not  global. 
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Part  2.  Synthesis  Theory 
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2. 1  Introduction 


2.1.1  Overview  of  Synthesis 


From  the  previous  part  of  these  notes  on  analysis,  we  have  seen  that  the 
synthesis  problem  in  each  case  reduces  to  finding  a  controller  K  which 
achieves  internal  stability  and  solves 

min  Ikup./olL  a=2,«,  or  fj.  /. \ 

11  W 

where 

€  ,  Pii  e 


P  = 


l^u  Pit 
Pz\  Pzt 


and 


Fi(P.K)  =  Pn  *  PxZK{I-PaK)-'Pzx 


We  will  restrict  our  attention  for  now  to  the  a-Z  and  •»  cases  of  (l). 
Recall  that  the  a-P  case  of  (1)  can  be  converted  to  the  a-m  case  by  scaling. 
The  approach  of  these  notes  is  to  develop  the  a=2  and  «  cases  in  a  parallel 
manner,  emphasizing  their  common  features. 

We  begin  by  considering  the  special  case  of  (1)  where  all  matrices  are 
constants.  This  is  an  interesting  problem  in  its  own  right  and  manages  to 
capture  the  essential  features  of  the  general  problem.  While  the  a=2  case  is 
quite  straightforward,  the  key  step  in  the  solution  of  (1)  for  a  =  «  was  first 
published  in  1992  by  Davis,  Kahan,  and  Weinberger  in  their  important  paper 
on  norm-preserving  dilations  [DKW],  The  treatment  of  the  constant  case  in 
this  chapter  is  based  on  this  paper. 


The  remainder  of  this  part  of  the  notes  involves  taking  each  step  of  the 
solution  to  (l)  for  the  constant  case  and  generalizing  to  the  case  of  real- 
rational  matrices.  The  difficulty  arises  from  stability/causality  considera¬ 
tions  which  are  not  present  in  the  constant  matrix  case.  If  the  requirement 
for  internal  stability  were  dropped  from  the  optimization  problem  in  (1)  then 
it  would  reduce  immediately  to  the  constant  case  at  each  frequency.  In  gen¬ 
eral,  however,  any  K  obtained  in  this  way  would  not  be  one  such  that  Ft(P,K ) 
is  stable. 


The  role  of  the  stability  requirement  can  be  seen  by  considering  a  spe¬ 
cial  case  of  (1)  where  Pzz  =  0  and  both  Pis  and  P21  are  square  and  inner. 
With  these  assumptions,  Ft(P,K)  =  Pn  -  PizKPzi.  so  internal  stability  is 
equivalent  to  stability  of  K  (  K  £  RHm).  Since  both  a  =  2  and  *  norms  are 
unitary  (Le.  inner)  invariant, 

!jW.JO!L  =  jj*  -  4  "here  R  =  PvfPnPi'-  (2) 

Since  the  RHm  part  of  R  in  (2)  may  be  absorbed  into  K,  we  see  immediately 
that  our  simplified  problem  reduces  to 


where  R  £  RH^~. 


min  II/?  +  k\  L 
jt  tutr**  11  11 


0=2  or  », 


(3) 


For  the  a=2  case  we  have  immediately  that  the  optimal  K  in  (3)  is  K=Q, 
since  Lg  is  a  Hilbert  space  and  Hz  ami  H are  orthogonal  subspaces.  The 
e=»  case  is  a  version  of  the  well-known  matrix  interpolation  problem.  It  is 
the  matrix  generalization  of  the  classical  "best  approximation"  problem  of 
finding  the  nearest  Hm  function  to  a  given  function  in  Lm. 


This  best  approximation  problem  has  a  rich  history.  The  rational  matrix 
version  is  as  follows:  given  R  £  RH^  (i.e.  strictly  proper  with  all  poles  in  the 
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open  right  half  plane)  find  ail  K  e  RHm  (if  any  exist)  such  that 

||*  -  (4) 

More  general  versions  drop  the  rationality  assumption.  There  is  a  vast  litera¬ 
ture  on  interpolation  problems  and  the  related  question  of  best  approxima¬ 
tion.  Some  of  the  early  references  can  be  found  in  the  seminal  paper  on 
"generalized  interpolation"  by  Sarason  [Sar].  Alternative  treatments  include 
those  by  Adamjan,  Arov,  and  Krein  ([AAKl]-[AAK3])  and  Ball  and  Helton 
[BaHj. 

The  most  recent  contribution  to  the  rational  matrix  best  approximation 
problem  is  by  Glover,  who  gives  a  complete  parametrization  for  all  solutions 
to  (4)  in  terms  of  a  state-space  realization  for  R  [Glo].  Furthermore,  a  reali¬ 
zation  of  the  parametrization  can  be  computed  from  the  realization  of  R 
using  standard  real  matrix  operations.  The  main  goal  of  this  part  on  syn¬ 
thesis  is  to  reduce  the  problem  in  (l)  to  that  in  (3).  The  a=2  case  is  then 
trivial,  and  the  a==°  case  can  be  solved  using  Glover's  results. 

An  important  feature  of  the  results  that  follow  that  achieve  the  reduc¬ 
tion  of  (l)  to  (3)  is  that  each  step  can  be  computed  using  standard  real 
matrix  operations  in  terms  of  a  realization  of  P.  This  allows  the  entire  solu¬ 
tion  scheme  to  be  implemented  in  software  in  a  reasonably  straightforward 
manner,  which  is  important  if  the  mathematical  results  of  these  notes  are  to 
be  applied  to  engineering  problems.  The  connection  of  the  theory  with  com¬ 
putation  is  reinforced  throughout  by  the  approach  of  proving  existence  parts 
of  theorems  by  giving  what  amounts  to  an  algorithm  for  computing  a  particu¬ 
lar  solution.  While  this  occasionally  makes  for  rather  tedious  proofs,  the  fact 
that  the  theorem  can  be  implemented  almost  directly  makes  this  approach 
seem  well  worth  it.  It  should  be  noted  that  each  main  theorem  in  this  pu-t 


.  • 
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has  been  implemented  and  experimental  control  systems  have  been 
designed  using  this  software.  The  results  to  date  have  been  most  encourag¬ 
ing. 

To  see  the  importance  of  the  stability  requirement  and  the  resulting 
best  approximation  problem,  note  that  relaxation  of  the  requirement  in  (3) 
that  K  e  RHm  to  simply  requiring  that  K  e  Rj,  means  that  the  minimum  of  0 
in  (3)  is  achieved  by  letting  K  -  -R.  This  is  essentially  what  holds  for  the 
constant  case.  Thus  the  stability  requirement  restricts  the  achievable 
optimal  in  (3).  As  we  shall  see,  the  stability  requirement  has  a  similar 
impact  on  the  general  problem  in  (1). 


c.  l.Z  Ccc.scant  Matrix  Case 


In  this  section,  we  will  consider  a  special  synthesis  problem  where  all 
matrices  are  constants.  The  constant  matrix  case  will  allow  us  to  study  the 
synthesis  problem  in  a  simplified  context,  but  one  which  parallels  the 
rational  case. 

For  constant  matrices,  the  norms  reduce  to 


I  !P  :  !.  =  5(P) 

P  ;*«[wv)]M 

Note  that  these  definitions  are  not  conventional,  but  they  are  convenient  in 
allowing  parallel  development  of  the  constant  and  rational  cases. 


Consider  the  constant  matrix  problem 

min  .  Ti{P.K)  c  =  2.* 

Zee**™*  ' 


(1) 


where 

^  21  *  ZZj  ’ 

and 

Tt(P.fC)  =  Pn-P\zW-PzLK)-xPiv 

Assume  that  P*zP\z  >  0  and  Pz\Pz\.  >  0. 

The  first  step  is  to  make  the  substitution  of  variables 

K  =  Q(!-PzZQY\  Q  =  (P&P*)-*Q{PzlPii)-* 

SO 

Q  =  (PuPaWl-PaV-KPziPfiP- 

Using  the  linear  fractional  representation  notation. 


(2) 


(3) 


2. 1.5  Rational  Matrix:  Generalization 

The  steps  in  the  rational  case  closely  parallel  the  constant  case,  as 
shown  in  Figure  1.  Most  of  the  work  in  the  remaining  chapters  is  devoted  to 
generalizing  these  steps  from  constants  to  rationals.  The  source  of  all  the 
difficulty  in  the  rational  case  comes  from  the  requirement  for  internal  stabil¬ 
ity,  or  equivalently,  causality.  Without  this  the  rational  case  would  reduce  to 
the  constant  case  at  each  frequency,  and  could  be  solved  using  the  results  of 
the  previous  two  sections. 

We  will  now  outline  the  steps  required  to  solve  the  rational  case  and  pre¬ 
view  the  upcoming  chapters  which  complete  the  details. 

1)  Parametrization:  Find  J  so  that  the  substitution  K^F^J.Q)  yields 

F(P.K)  =  Fi(P  ,Fj(/ ,Q)) 

=  Fi(T,Q) 

-  T  TiiQTzi  (10) 

with  the  additional  requirement  that  T  e  Hm  and 

Yi(P,K)  internally  stable  (11) 

i iff  Q  e  Hm. 

This  parametrizes  all  stabilizing  fCs  in  terms  of  a  stable  Q  e  Hm  in  addi¬ 
tion  to  providing  an  affine  parametrization  of  all  stable  Ti(P,K).  This 
so-called  "Youla  parametrization"  [You2]  is  developed  in  Chapter  2  on 
Stabilization. 

A  further  requirement  is  that  T&  and  Tz j  be  inner,  that  is  T,*  F1Z=/  and 
T’ziT'j"  =  /.  Methods  for  obtaining  the  particular  parametrizations  which 


2.1.4  Figure  1 


min  \iFi(P,K)'<  where  F^P.K)  =  Pu  ^  PlzK(!-PaK)-lPzl 

K  .  .  H® 


parametrization 


K=Ft(J,Q) 


min  'Tn  ~  T^QTn  ;  where  u  —  I  T^Tzi  -  I 

Q  :  ia 


unitary  invariance 


!  1*11-9  *21  I! 

min  p  p 

s  Xl2  Kzz  a 


projection 


dilation 


I! 

i|G~9 


n^n 


Kgpt -Ft(j ,  Qopt  ) 


Note  that  Q=-Rn  is  one  solution  to  (6)  and  (8). 

4)  Recovery  of  the  optimal  K:  This  is  obtained  by  simply  computing  K 
from  the  formula  K=Fi{J,Q)  used  in  step  1)  to  parametrize  the  prob- 
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Recall  that  without  loss  of  generality,  we  may  assume  the  TuTfczQ  so 
that  (2)  becomes 


|*n+$j 

*21 


(4) 


3)  Projection  /  Dilation:  At  this  point  the  a=2  and  a=»  cases  differ.  For 
a=2,  the  problem  reduces,  by  projection,  to 


min 

Q 


(5) 


which  has  the  unique  solution  Q--R u- 


The  a=«°  case  must  be  treated  using  the  matrix  dilation  theory  of  the 
previous  section  Recall  that,  in  general,  the  solution  is  not  unique. 
From  Theorem  3.1.  all  solutions  to 


*  7  f°r  7*  |  j. 

are  of  the  form 

Q  =  -Rn  +  F(72/-*2,1*2i)X  (7) 

n  j| 

for  some  JjFjj.s  1.  Corollary  3.1  gave  the  alternative  characterization 
that 

j  *ii+  Q 

\  *21 

if  and  only  if 

!|(*iI+«)(72/-*2,i*2i)-Xj|.  *  1.  (9) 


£y  for  y  > 


*21  I 


(9) 


It  is  this  latter  characterization  which  will  be  used  in  the  rational  case. 


IB 


2. 1.4  Summary  of  Constant  Problem 


The  rational  matrix  problem  in  equation  (1.1)  can  be  solved  in  a  manner 
which  parallels  the  treatment  of  the  constant  ewe  in  the  last  two  sections. 
This  generalization  is  the  focus  of  the  next  three  chapters  on  synthesis.  To 
reinforce  the  similarity  between  the  constant  and  rational  case,  we  will  now 
review  the  key  steps  from  the  previous  two  sections  and  preview  their  gen¬ 
eralizations  to  the  rational  case. 

Consider  the  diagram  in  Figure  1.  This  summarizes  the  steps  in  the  con¬ 
stant  matrix  problem  (2.1).  The  main  steps  are  as  follows: 


1)  Parametrization:  Make  the  substitution  K=Fi(J,Q)  so  that 


W.V  =  Fi(P.Ft(J.Q)) 

=  Fi(T.Q) 

=  Tu  -  Tl2QT2l  (1) 

is  affine.  Additionally,  we  want  T*2Tl2=I  and  T2i  T2i  =/. 


2) 


Unitary  Invariance: 
and  unitary*. 


Find  7j_and  7j_so  that  |FIZ  7jj  and 


7*ei 

?jJ 


are  square 


Pre-  and  post-multiply  by  Fie 


7jj  and 


TZI 

Jli 


to  yield 


R\\+Q  R i2 
R2\  Rzz 


(2) 


where 


(26) 

where  Rz  -  (t2/  -AAm)  ~^Ri 
U  2  =  (t2/  -AAm)~^U. 

To  complete  this,  simply  factor  L’2  to  extract  a  unitary  factor,  and  apply  the 
dual  of  (22)-(26)  to  (26).  Although  the  formulas  get  messy.  (22)  can  be 
solved  in  this  manner. 

In  each  of  these  cases,  Theorem  2,  Theorem  3,  their  corollaries,  and  the 
solution  described  above,  the  general  case  reduces  almost  immediately  to 
application  of  Theorem  1  or  Corollary  1.  Thus,  when  it  is  convenient,  we  will 
consider  (4)  rather  than  (16)  and  (26)  rather  than  (22).  This  will  simplify  the 
discussion  of  the  rational  case  without  introducing  any  loss  of  generality. 

The  restriction  that  y  >  y0  in  Corollary  3  and  (22)-(26)  does  introduce 
some  loss  of  generality.  If  these  alternatives  to  Theorem  3  are  used,  it  is  not 
possible  to  get  all  solutions  for  y  =  y, .  All  solutions  arbitrarily  close  to  the 
optimal  7,  is  the  best  that  can  be  done.  The  reason  for  considering  this  spe¬ 
cial  case  is  that  (20)-(21)  and  (22)-(26)  have  reasonably  straighforward  gen¬ 
eralizations  to  the  rational  matrix  case,  whereas  (17)-(18)  do  not.  A  further 
difficulty  with  the  rational  case  is  that,  unlike  the  constant  case,  it  appears 
that  it  is  not  possible  to  actually  compute  y ,  exactly.  This  makes  our  inabil¬ 
ity  to  obtain  a  direct  generalization  of  Theorem  3  seem  less  critical,  at  least 
with  respect  to  application  of  this  theory.  More  will  be  send  about  this  in  Sec¬ 
tion  5,  but  the  rational  matrix  generalization  will  focus  on  (22)-(2S). 


The  following  corollary  gives  an  alternative  version  of  Theorem  3. 


Corollary  3  For  7  >  7, , 


|  X  B 
I  1C  A 


II 


*  7 


(19) 


iff 


j  |(/-m -*(X+  rAZ)(I-ZmZ)  -H 


*  7 


(20) 


where 


Z  =  (7 2I-AA‘)~*C  (21) 


There  are  many  alternative  characterizations  of  solutions  to  (19), 
although  the  formulas  in  (20)  and  (21)  seem  to  be  the  simplest 


For  the  problem  in  (14),  the  following  equivalences  apply  for  all  7  >  7,  : 

\R*UQV  ||.  a=  7  (22) 

iff 

\\m*7  (23) 

iff 


R V  *  UQ  RV£ 


\Rt-VQ 
where  Rx  -  JtV*  and  A  =  RV£ 


UT 


(72/-A4,)'k 


R,  .  w]  JI.-1 


(  by  Corollary  1'  ) 


(24) 


(25) 
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form  of  (2.4)  for  the  problem 

7»  =  min  ||i?  +  UQV\^m  (14) 

where  U*U  =  /  and  W*  =  / 

Corollary  2 

7.  =  max  {|j^|L  |^Kl|L  <1») 

The  following  theorem  parametrizes  all  solutions  to  (l).  The  proof  is 
omitted  (it  can  be  found  in  [DHK]),  but  is  similar  to  Theorem  2  and  involves 
application  of  Theorem  1  and  1’. 

Theorem  3  Suppose  7  5s  7,.  The  solutions  X  such  that 

NIL- 

are  exactly  those  of  the  form 

X  ~  -YA*Z  +  'y(I—YY*ftW(J—Z*Z)%  (17) 

where  W  is  an  arbitrary  contraction  ( '  jjf!  j«il)  and  Y  and  Z  solve  the  linear 
equations 


B  -  Ytfl-A'A^ 
C  =  tfl-AA'ftZ. 


(18) 


Theorem  2 


7.  =  max  [c*  xjjL,  ^ 


Proof:  Denote  by  7  the  right  hand  side  of  the  equation  (13).  Clearly,  ye>7 

since  compressions  are  norm  decreasing.  That  7,^7  will  be  shown  by  using 
Theorem  1  and  1*. 

From  Theorem  1  we  have  that  B-Y(^I -A*A)*  for  some  Y  such  that 
||Fj|«Sl.  Similarly,  Theorem  1‘  yields  Csffil-AA'ftZ  for  some  Z  with 

kii-*1- 

Let  X  =  -YAmZ.  Then 


5]!  |  !![  -X4*Z  Y(j*I-A*A)* 

A  =  j|  (5?I-AA*)*Z  A 


I  if  •A*  (tfl-A'A^ ||f 


!k^/-A4*)« 


-A*  ffl-A'A^ U  -A  ??!-AA'$  tyl  0 

0*I-AA*)*  A  ][ ffl-A’A)*  A 9  ~  [0  ?1 


Thus  7^7,,  so  ^=7,. 


This  theorem  gives  one  solution  to  (12)  and  an  expression  for  yt.  As  in 
(3),  thei  may  be  more  than  one  solution  to  (12),  although  Theorem  2  only 
exhibits  one.  Theorem  3  considers  the  problem  of  parametrizing  all  solu¬ 
tions.  The  solution  X  =  -YA*Z  is  the  "central"  solution  analogous  to  X  -  0  in 
(3).  The  next  corollary  is  an  alternative  statement  of  Theorem  2  using  the 
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Corollary  1:  For  y>y0 . 


in- 

(6) 

iff 

Xtfl-A'A)-*  |  j„  1. 

(7) 

The  corresponding  dual  results  are 

Theorem  1'  For  v-y  i  y„ 

IMI!-*?  “ II4« 

(s) 

such  thet 

X  =  {ytl-AA'fty 

O) 

• 

Corollary  1’  For  y>y0 

IIHII-*’’ 

(10) 

.  • 

iff 

\tfI~AA0)-*x\\m*  1 

(U) 

Now,  returning  to  the  problem  in  (1),  let 


The  following  theorem,  usually  attributed  to  Parrott  [Par],  will  play  a  central 
role  in  the  synthesis  theory.  The  proof  is  a  straightforward  application  of 


Theorem  1  and  1’. 


2 


in  (3).  it  is  immediate  that  yc  =  xj  j».  The  following  theorem  characterizes  all 
solutions  to  (3). 


Theorem  1:  For  Vy  y9 , 


iff  r  r  With  !M  Ul  such  that 

It  M 

X=Ytfl-A9A)* 

Proof: 


X*X  -  A9 A  <  y2! 


X*X  £  tfI-A*A) 

|  [aTix  1 1  as  ||(tz/-^M)hu  Vu 
X  =  r(y*I-AmA')*  for  some  ||yj|^l 


(4) 

(5) 


This  theorem  implies  that,  in  general,  (3)  has  more  than  one  solution.  This  is 
in  contrast  to  the  a  =  2  case.  The  solution  X  =  0  is  the  central  solution  but 
others  are  possible  unless  A*A  =  y1!.  A  more  restricted  version  of  the 


theorem  is 


2.1.3  Matrix  Dilation  Problems 


Consider  the  optimization  problem 


lx  51  I 

C  A]  |. 


where  X,B,C,A  are  constant  matrices  of  compatible  dimensions.  This  is  a 

be  b] 

restatement  of  (2.7)  for  the  a=“  case.  The  matrix  L-.  J  is  a  dilation  of  its 
submatrices  as  indicated  in  the  following  diagram: 


X  B 

C 

B 

C  A 

4- 

A 

In  this  diagram,  c  stands  for  the  operation  of  compression  and  d  stands 
for  dilation.  Compression  is  always  norm  decreasing;  sometimes  dilation 
can  be  made  to  be  norm  preserving.  Norm  preserving  dilations  are  the  focus 
of  this  section,  which  basically  follows  the  development  in  [DKW]  and  [Pow], 
See  [DKW]  for  additional  references  on  dilation  problems. 

The  simplest  matrix  dilation  problem  occurs  when  solving 


Although  (3)  is  a  much  simplified  version  of  (1),  we  will  see  that  it  contains  all 
the  essential  features  of  the  problem.  Letting  ye  denote  the  minimum  norm 


I 


3 


!i[/?u +Q  R 


m 

<?e 


and  solution  of  (9)  yields  a  solution  of  (1)  by  solving  (2)  for  K. 
The  a  =  2  case  can  be  solved  immediately  from  (9)  since 


Ru+Q  Rig 

!l« 

i  _ 

\Z 

0  Rig 

s 

R  21  Rz  2 

11 = 

\Rn+Q 

I* + 1 

Rgi  Rgs, 

la 

Thus 


Qo pi  -  ~~R a  =  ~T?zTuTzi 


and 


\Rn+Q  R vz 

R  21  R  22 


II 

!!  = 

;  x 


0  Rig 

Rgi  Rzz 


i  a 


(8) 


(9) 


(10) 


The  simplicity  of  the  a=2  case  is  responsible  for  much  of  its  appeal. 
Optimization  in  this  norm  reduces  to  projection  since  Lg  is  a  Hilbert  space. 
This  holds  as  well  for  the  rational  matrix  problem. 

The  a=»  case  is  somewhat  more  complicated  since  Lm  is  not  a  Hilbert 
space  and  the  minimization  in  (9)  cannot  be  solved  by  projection.  For¬ 
tunately,  Lm  arises  as  the  space  of  linear  operators  on  the  Hilbert  space  Lg, 
and  (B)  can  be  treated  as  a  dilation  problem.  The  next  section  focuses  on 
matrix  dilation  problems. 


where 


kn  Jiz)  0  (PxPa)-* 

J  =  Vzi  M  =  taiW*  -(*21*2*1 )  ***2z(*iz*i2)- -M. 

With  this  substitution,  we  have 

*■,(*.*)  =  F,(p.F,(;,i?)) 

=  Pit  +  lfli(PizPlz)  _HJ  C^PziPil  )  '^2lj 

=  7*11  +  TizQTzi  (5) 

where  the  7\,  are  defined  in  the  obvious  way.  This  parametrization  has  con¬ 
verted  the  nonlinear  problem  in  (1)  to  one  affine  in  the  parameter  Q.  Note 
that  TizTiz  =  !  and  Tz\ Tzi  -  /.  Thus  we  can  find  7j_and  7j_such  that  both 
r  i  Ttl 

|^iz  7jj  and  j:  are  square  and  unitary. 


Since  both  a  =  2  and  "  norms  are  unitary  invariant 


where 

Ni 

O  O  T  *  T  T  *  T 9  T  T  .* 

“ll  12  *  12 '  11*21  M2M1*1 

*2i  *22]  =  riTurf, 


Thus  the  problem  in  (l)  reduces  to 
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achieve  this  are  developed  in  Chapter  3  on  Factorization. 


.  -  1  T  21 

2)  Unitary  Invariance:  Find  7j_  and  7j_  so  that  Tl2  7jj  and  are 

square  and  inner  (also  Chapter  3).  Then  pre*  and  post-multiply  by 

i  i*  M* 

\Ta  and  ^  to  yield 


Rn+Q  R\t 

Rzi  Rzz 


where 


f  ~  1 

p-  =  |rj.  ti\ 

IN* 

Again,  to  simplify  the  presentation  suppose  that  TUTI=0  so  that  (12) 
becomes 


Ru+Q 

Rzi 


3)  Projection  /  Dilation:  At  this  point  the  a=2  and  a=»  cases  again  differ. 
For  a=2,  the  problem  reduces,  by  projection,  to 

m.  IK +  Q\l  (i4) 

But  since  Rn  €  Lm  ,  Q=-Rtl  would  not  correspond  to  a  stable  solution. 
The  unia.ue  solution  is  yet  another  projection 

Qopt  =  Pnt(R  n)  (15) 

where  Pfft  denotes  projection  onto  H2.  When  viewed  appropriately,  these 
two  projections  can  be  seen  as  a  single  .projection  onto  a  subspace  of 
Lz(jK 


The  a=“  case  is  again  treated  as  a  dilation  problem.  While  it  is  not  true 


in  general  that 


Ru+Q 

min 

eeff. 

R21 

L  >  *  *  IN 

(18) 


it  is  convenient  to  use  the  characterization  in  Corollary  3.1  and  (4.8)- 
(4.9).  Recall  that 

|*n+9 

i?2i 

iff 

|j(*n+9)(72/“*8V?ai)-,i|j.  *  1  (IB) 


for  7  >7 


(17) 


The  key  to  proceeding  in  the  rational  case  is  to  find  U  E  RHm  such  that 
M~l  E  RHm  and  /./*/</ = (t2! ~Rzi Rz\)-  If  we  use  the  symbol  (7 zI-Rz\RzC^ 
to  denote  this  M,  then  (18)  makes  sense  in  the  rational  case.  Rnding  M 
involves  spectral  factorization  and  is  treated  in  Chapter  3. 


Given  M  e  RHm  with  the  desired  properties,  (18)  reduces  to 


G  +  § 


£ 


1 


(19) 


where  G=RuM~l  cRHm  and  Q=QM~l.  Solving  (19)  for  Q  £  RHm  solves 
(18)  for  Q  E  RH _  Note  that  Q=$M  is  in  RHm  if  3  is,  since  U  £  RHm  by 


construction. 


The  final  step  in  the  rational  case  then  involves  solving  (19)  for  $  E  RHm. 
This  is  the  standard  mathematical  problem  of  approximating  an  Lm 
matrix  by  an  Hm  matrix.  The  approach  which  fits  most  naturally  with  the 
methods  of  these  notes  is  due  to  Glover  [Glo]. 


Glover  gives  an  explicit  parametrization  of  all  solutions  to  (19)  in  terms 
of  a  realization  of  G.  When  combined  with  the  steps  developed  in  these 


notes  which  reduce 


||f1(p.ao]iL  *  y  (20) 

to  (17)  and  then  (,19),  this  yields  a  parametrization  of  all  FT s  which 
achieve  internal  stability  and  satisfy  (20).  The  remainder  of  these  notes 
is  concerned  with  developing  the  mathematics  to  carry  out  the  steps 
outlined  above. 

4)  Recovery  of  the  optimal  K  Just  as  in  the  constant  case  ,Q9pt). 

This  will  stabilize  Ti(P,Kopt)  since  the  parametrization  in  Step  1) 
insured  that  Q  stable  lead  to  internal  stability  of  Ti(P,K)=Fl(P,Ti(J ,Q)). 

Note  that  although  restricting  to  the  special  case  of  (13)  from  (12)  intro¬ 
duces  no  loss  of  generality,  it  may  be  possible  that  (16)  does  not  hold.  Let 


7. 

Kl  - 

(21) 

~7  * 

ML- 

(22) 

Then,  it  is  possible  that  yt  =  y.  In  that  case,  the  algorithm  described  above 
cannot  be  used  to  obtain  all  solutions  such  that 


since  (17)  is  not  equivalent  to  (IS).  It  is  possible  to  find  all  solutions  for  any  7 
arbitrarily  close  to  y0 . 

A  more  fundamental  difficulty  with  the  algorithm  outlined  above  is  that 
it  gives  no  simple  way  of  determining  y0.  so  that  the  algorithm  must  be 
iterated  on  with  a  search  on  7.  It  seems  that  the  best  that  can  be  expected 
is  to  get  arbitrarily  close  to  ye.  In  view  of  this,  the  special  case  when  yt 


would  seem  not  to  be  of  particular  importance. 


The  question  naturally  arises  whether  the  inability  to  compute  yt  is 
intrinsic  or  simply  an  artifice  of  the  chosen  methodology.  lhis  cannot  be 
fully  resolved  in  these  notes,  but  there  is  some  evidence  to  suggest  that  com¬ 
putation  of  7„  is  a  fundamentally  difficult  problem.  In  particular,  it  can  be 
shown  that  ya  is  equal  to  the  norm  of  the  operator 


(24) 


-ftei  I 

This  implies  that  yt  can  be  found  as  the  solution  of  an  eigenvalue  problem, 
but  of  a  matrix  (or  operator)  which  is  infinite  rank.  Thus  again,  from  this 
point  of  view,  y0  can  be  computed  only  to  within  an  arbitrarily  small  accu¬ 
racy.  but  not  exactly.  It  is  clear  that  this  issue  needs  more  research. 


The  remainder  of  these  notes  completes  the  details  of  the  solution  out¬ 
lined  in  steps  l)-4)  above.  This  provides  a  general  solution  to  the  Ha  optimal 
control  problem  modulo  the  above  remarks. 


2.1.5  Figure  1 
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T 
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min 
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2.2  Stabilization 


2.2.1  Introduction 

This  chapter  considers  the  problem  of  finding  a  parametrization  of  all 
controllers  K  C  which  achieve  internal  stability  of 

? i(P.X)  =  Pn  +  PkK(I-Pz>K)-'Pzi 

where 


(1) 


P  = 


P\\  P\z 
Pn  Pzz 


(2) 


e  ,  />*  e 

The  notion  of  internal  stability  for  (l)  is  considered  in  Section  2.  where  the 
appropriate  definitions  and  properties  are  developed.  The  treatment  here  is 
in  the  spirit  of  [DeC]  and  [Per]. 

The  approach  taken  in  the  rest  of  this  chapter  is  to  find  J  so  that  the 
substitution  K-Ft(J,Q)  yields 

Tt(PJO  =  r,(P.Fi(/.Q)) 

=  F,(7\$) 

=  Tu  -  T12QTZi  (3) 

with  the  additional  property  that  T  C  Hm  and 

F{(P,K)  internally  stable  (4) 

iff  Q  £  Hm. 


This  approach  parametrizes  all  stabilizing  f  s  in  terms  of  a  stable 
Q  £  Hm  in  addition  to  providing  an  affine  parametrization  of  all  stable 
Fi{P,K).  This  so-called  "Youla  parametrization"  [You2]  is  developed  in  Sec¬ 
tion  3  on  Parametrization  of  All  Stabilizing  Controllers.  Section  3  puts  the 
algebraic  methods  of  stabilization  as  expounded  by  Desoer  and  co-authors 
[DML]  in  the  context  of  the  linear  fractional  transformations  used  in  these 


There  are  two  main  approaches  to  constructing  stabilizing  controllers  of 
linear  systems,  the  Youla  parametrization  and  state-space  methods  using 
observers  and  state  feedback([KBF],[Lue]).  Each  is  well-known  within  the 
control  theory  community  and  each  has  its  advantages.  As  indicated  above, 
the  Youla  parametrization  yields  a U  stabilizing  controllers  as  well  as  a  con¬ 
venient  affine  parametrization  of  the  closed-loop  system.  Unfortunately,  the 
standard  algebraic  treatment  of  this  subject  gives  no  reliable  scheme  to 
compute  the  coefficients  of  the  parametrization.  Observer-based  stabilizing 
controllers,  on  the  other  hand,  are  easily  constructed  in  terms  of  a  realiza¬ 
tion  of  P  using  a  variety  of  state-space  computation  schemes. 

Section  4  shows  that  these  two  methods  of  stabilization  are  actually 
equivalent  in  a  very  direct  way.  This  allows  for  the  Youla  parametrization  to 
be  constructed  using  the  standard  state-space  computations  of  observer- 
based  stabilization  methods,  providing  explicit  realizations  of  the  desired  J 
in  (3)  in  terms  of  a  realization  of  P.  Section  5  puts  all  of  the  results  of  the 
preceding  sections  together  to  construct  the  desired  affine  parametrization 
of  the  closed  loop  system. 

It  should  be  noted  that  many  of  the  results  on  the  connections  between 
the  algebraic  and  observer-based  stabilization  methods  were  discovered 
independently  by  Nett  and  coauthors  [NeJ].  Also,  many  of  these  results  were 
known  within  the  "systems  over  rings"  community  [Kha].  What  is  clearly  ori¬ 
ginal  to  these  notes  is  the  complete  equivalence  of  the  two  stabilization 
methods  (Section  4,  Theorem  2)  and  the  parametrization  in  terms  of  the  gen¬ 
eral  framework  using  linear  fractional  transformations  (Section  5). 
Nevertheless,  this  material  is  not  the  main  focus  of  these  notes  but  is 


3 


presented  in  detail  so  that  it  may  be  used  in  developing  the  factorization 
methods  of  the  next  chapter. 


proper. 


wgg 


It  is  straightforward  to  show  that  (3)  is  equivalent  to  either  of  the  follow¬ 
ing  two  conditions: 


I  -K(~) 

~Pzzim)  I 

I  -  PaHKH 


is  invertible  ; 
is  invertible  . 


(4) 

(5) 


The  well-posedness  condition  is  simple  to  state  in  terms  of  state-space 
realizations.  Introduce  minimal  realizations  of  P  and  K: 


A 

Bx  B2 

p  = 

Ci 

Du  Du 

Cz 

DZ 1  Dj2 

A 

1  +* 

1 B 

C 

IA 

(6) 

(?) 


Note  that  the  partition  in  (8)  corresponds  to  that  in  (2),  i.e., 


Pa  = 


A 

B}] 

Ci 

Disl 

(8) 


Then  PzzW-Dzz  md  K(<*‘)=5,  so  for  example,  from  (4)  well-posedness  is 
equivalent  to  the  condition  that 


/  -3 

-Dzz  I 


is  invertible. 


(9) 


Well-posedness  will  be  assumed  for  the  rest  of  this  section.  Let  s  and  £ 
denote  the  state  vectors  for  P  and  K  respectively,  and  write  the  system 
equations  in  Figure  1  with  v  set  to  zero  and  e  ignored: 

x  =  Ax  ~  Bzu 
y  -  Cgc  -r  Dzzu 


(10a) 

(10b) 
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-  =  AS  y  By  (10c) 

u  =  tx  +  by.  (lOd) 


The  system  of  Figure  1  is  internally  stable  provided  the  origin  (r ,*)  =  (0,0) 
is  asymptotically  stable.  To  get  a  concrete  characterization  of  internal  sta¬ 
bility,  solve  equations  (10b)  and  (lOd)  for  u  and  y: 


u  _  i  -b 

0 

t 

Z 

y]  =  [-^a  / 

pi 

0 

2 

(Note  that  the  inverse  exists  from  (9)).  Now  substitute  this  into  (10a)  and 
(10c)  to  get 


dfF 


=  A 


where 


*» 

A  0 

Bz  o 

i  -b 

-1 

0  C 

A  = 

0  l 

«r 

0 

-Dzz  I ' 

Cz  0 

Thus  internal  stability  is  equivalent  to  the  condition  that  A  has  all  its  eigen¬ 
values  in  the  open  left  half-plane. 


It  is  routine  to  verify  that  the  above  definition  of  internal  stability 
depends  only  on  P  and  K,  not  specific  realizations  of  them.  The  following 
result  is  standard. 

Lemma  2.  Consider  a  minimal  realization  of  P  as  in  (6).  There  exists  a 
proper  K  achieving  internal  stability  ifl  {A,Bz)  is  stabilizable  and  {CzA)  is 
detectable. 


The  latter  stabilizability  and  detectability  conditions  are  ;sumed 
throughout  the  remainder  of  this  chapter.  Since 


e-uations  (10)  constitute  a  state-space  representation  of  the  system  in  Fig¬ 
ure  2.  Although  the  realization  in  (ll)  is  not  necessarily  minimal,  it  is  stabil- 
izable  and  detectable,  and  these  are  enough  to  yield  the  following  result. 

Lemma  3.  The  system  in  Figure  1  is  internally  stable  iff  the  one  in  Figure  2 
is. 

The  next  section  contains  a  parametrization  of  all  JCs  which  achieve 
internal  stability  for  the  system  in  Figure  2.  To  simplify  notation,  define 

G  :  —  P%2  .  B  B%  ,  C  Cg  ,  D  :=  Dzz  ■ 

Then  (A. B)  is  stabilizabie.  ( C,A )  is  detectable,  and  the  system  under  study  is 
that  in  Figure  3. 

The  above  notion  of  internal  stability  is  defined  in  terms  of  state-space 
realizations  of  G  and  K.  It  is  important  and  useful  to  characterize  internal 
stability  from  an  input/output  point  of  view.  For  this,  consider  the  feedback 
system  in  Figure  4.  This  system  is  described  by: 


Now  it  is  intuitively  clear  that  if  the  system  in  Figure  4  is  internally  stable 
then  for  all  bounded  inputs  the  outputs  (ej.ej)  are  also  bounded.  The 

following  lemma  shows  that  this  idea  lends  to  an  input/output  characteriza¬ 
tion  of  internal  stability. 

Lemma  4.  The  system  in  Figure  4  is  internally  stable  if  and  only  if  ( I-GK ) 
is  invertible  and  the  transfer  matrix 
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I  -K 

-l 

I +K(I—GK)~1G  K(I-GK)~X 

-G  I 

(I-GK)~XG  (I-GK)'1 

(ID 


between  (v,,t/2)  and  (e^ej)  belongs  to  RHm. 


Proof.  As  above  let  (A.B  ,C,D)  and  (A,B  ,C,D)  be  stabilizable  and  detect¬ 
able  realizations  of  G  and  K  respectively.  Then  the  state-space  equations  for 
the  system  in  Figure  4  are: 


lA  01U 

■  £  m 

«!" 

B  0  t 
o  £j[s 

1  \c  ojfr 

D  oil- 

J  =  1°  HF 

o  B\\i 

«i  =  v,  4  j/2,  e2  =  v8  +  y,. 


The  last  two  equations  can  be  rewritten  as 


h 

0  B 

ha 

C  0 

*«. 

+ 

Vg 

Now  suppose  that  this  system  is  internally  stable.  Then  (7)  implies  that 
(I-DB)  -  ( )  is  invertible.  Hence  ( I—GK )  is  invertible.  Further, 


since  the  eigenvalues  of 


0 

B  0 

I  -D 

-1 

0  c 

A\  ~ 

o  B 

-D  I 

C  0 

are  in  the  open  left  half  plane,  it  follows  that  the  transfer  matrix  in  (10)  from 
(vv,vz)  to  (e  i,e2)  is  in  RHm. 

Conversely,  supppose  that  ( I-GK)  is  invertible  and  the  transfer  matrix 
in  (10)  is  in  RHm.  Then,  in  particular,  (I-GK)'1  is  proper  which  implies  that 
( I-GK)( «)  =  (/ -DD)  is  invertible.  Therefore 


is  nonslngular.  Now  routine  transfer  function  calculations  give 


/  -D 

r 

e  t 

fo  cl 

3  0] 

^1 

-D  I 

h 

— 

TT 

0 

/■ - s 

Oi 

1 

Jt- 

0  B\ 

fu2 

Since  the  transfer  matrix  from  (^1,^2)  to  (e^ea)  belongs  to  RHm,  it  follows 
that 


0  c 

B  0 

c  0 

(sI-A)~l 

0  & 

belongs  to  RHm.  Finally,  since  ( A.B.C }  and  (A.£,£)  are  stabilizabie  and 
detectable,  it  follows  that  the  eigenvalues  of  A  are  in  the  open  left  half  plane. 


QED 

We  note  that  to  check  internal  stability  it  is  necessary  (and  sufficient)  to 
check  that  each  of  the  four  transfer  matrices  in  (ll)  are  in  RHm.  it  is  not 
difficult  to  construct  examples  of  G  and  K  such  that  any  three  of  the  four 
transfer  matrices  in  (ll)  are  in  RHm  while  the  fourth  one  is  unstable. 


2.2.3  Parametrization  of  All  Stabilizing  Controllers 


Two  matrices  NM  e  RHm  with  the  same  number  of  columns  are 
right  -coprime  if  the  combined  matrix 

there  exists  X,  Y  €  RHm  such  that  XM  +  YN  =  /.  This  is  often  called  a 
Bezout  or  Diophantine  equation.  An  alternative  definition  is  that  two 
matrices  in  RRm  are  right-coprime  if  every  common  right  divisor  in  RHm  is 
invertible  in  RHm.  This  can  be  shown  to  be  equivalent  to  the  above  definition 
in  terms  of  a  left  inverse,  but  we  will  not  use  this  fact. 

It  is  a  fact  that  every  G  e  Rp  (proper,  real-rational)  has  a  right-coprime 
factorization  G  =  Nli~l  where  N,Ii  €  RHm  are  right  coprime.  Similarly, 
there  exist  left  coprime  factorizations  (lcf),  defined  in  the  obvious  way  by 
duality.  The  proof  of  the  existence  of  such  coprime  factorizations  will  be 
given  in  the  next  section  with  explicit  realizations  for  the  factorizations.  In 
this  section,  we  will  see  how  these  factorizations  can  be  used  to  obtain  a 
parametrization  of  all  stabilizing  controllers. 

Begin  with  refs  and  lcf  a  of  G  and  K  in  Figure  4: 

G  =  NU~l  -  3~lN  (1) 

K  =  UV~'  s  v~lU  (2) 

Lemma  1.  Consider  the  system  in  Figure  4.  The  following  conditions  are 
equivalent: 

1.  The  feedback  system  is  internally  stable. 

2. 

[  V  -U 
1  \-N  U 

Proof:  As  we  saw  in  Lemma  2.3  of  the  last  section,  internal  stability  is 


is  invertible  in  RHm. 


U  V 
N  V 


is  invertible  in  RHm. 


li 

N 


has  a  left  inverse  in  RHm.  That  is. 


equivalent  to  the  condition  that 


[-C  7]  £  RH- 

or,  equivalently, 

[c/]  e  RH~  (3) 


Now 


[/  /d 

f  / 

uvl 

M  - 

\Nirl 

/ 

[ U  v\ 

k* 

0  1 

B 

k  A 

[0  nj 

so  that 


/  K 

-1 

M  0 

Si  V 

G  I 

0  V 

N  V 

Since  the  matrices 


M'M 

are  right-coprime,  (3)  holds  iff 

ij  £  ** 

This  proves  the  equivalence  of  conditions  1  and  2.  The  equivalence  of  1  and  3 
is  proved  similarly. 


QED 


We  shall  see  in  the  next  section  how  to  find  explicit  realizations  for 
N,  Si,  Nf.  3,  Ut,  V„  V,  and  such  that  (1)  holds  and 


\  -jjJ.Uir  U.  [/  o' 
-JV  fit  J[jv  V'\  s  [o  / 


The  above  lemma  says  that 

K,  k  UaV~'  =  V^UB 

then  qualifies  a  particular  controller  achieving  internal  stability.  All  stabiliz¬ 
ing  controllers  can  be  expressed  in  terms  of  Kg  and  a  parameter  Q,  as  shown 
in  the  following: 

Theorem  1.  The  set  of  all  proper  controllers  achieving  internal  stability  is 
parametrized  by  the  formula 

K=K  ^  V;'Q{I+Yr'NQ)-'\7'  (6) 

where  Q  ranges  over  RHm  such  that  (/  -  V‘lNQ)(<*)  is  invertible. 

Proof:  Assume  K  has  the  form  indicated. 


Define 


V  k  U,  +  MQ,  V  k  V.  +  NQ 

V  k  U,  +  QU,  V  k  V.  +  QN 


a  St  i  - 

-N  JS#j[Ar  vj  * 


v.+QN  -(V,+q3)}\u  U'+m 


U  U,  +MQ 
V,+NQ 


■  t  *  it  t  i 

■  t  it  i 


from  (5) 


I  0 
0  / 


( 7 ) 


Thus  K  achieves  internal  stability  by  lemma  1. 

Conversely,  suppose  K  is  proper  and  it  achieves  internal  stability.  Intro¬ 
duce  ref  and  lef  of  K  as  in  (2). 

Then  by  lemma  1 .  Z  A  MV  -  NU  is  invertible  in  RHm.  Define  Q  by  the 
equation 

U,  +  MQ  =  VZ~\  (8) 

so 

Q  =  M~\UZ-X  -  U,)  (9) 

Then 


V,  *  NQ  NM~l( UZ~l  -  U0) 

*  V,  *  3-'f}(UZ-'  -  U.)  from  (1) 

=  m-^mv.  3url) 

a  3~l(I  +  3UZ~1)  *  from  (5) 

=  3-\Z  + 

a  3~l3vz -1 

a  VZ_1  (10) 

Thus, 

a  CfV-» 

a  £/,K,  +  (Jtf  -  U,YT'N)Q(I  *  \7lNQ)-l\7l  (11) 

from  (prelim? ).  Then,  since 

(M  -  U„  KlN)  ~  (M  -  V~'UtN)  a  ?*»(?,#  -  U,N)  =  7"1 


we  have  that 


K  =  UaVa  +  K-^(/  +  V0-'NQ)~lVa-K  (12) 

To  see  that  Q  belongs  to  RHm,  observe  first  from  (9)  and  (10)  that  HQ 
and  NQ  both  do.  Right-coprimeness  of  N  and  M  then  implies  that  Q  €  RHm. 

Finally,  since  V  and  Z  evaluated  at  s=“  are  both  Invertible,  so  is 
Va  4>  NQ,  from  (10).  hence  so  is  /  +  Va~lNQ. 

QED 

Define  the  rational  matrix 


V  ] 

irl  -v?in\ 


and  consider  a  controller  K  given  by  formula  (6).  Then  the  controller  equa- 


r 


)\ 


u  *  TAJ.Qyy 

-  [/t  +  y,lQU  +  K-W‘t7ljy 

is  equivalent  to  the  triple  of  equations 
u  =  K,y  - 

«i  =  K-V  -  *Tl%i 

y,  =  Qut 

The  block  diagram  corresponding  to  this  triple  is  in  Figure  5.  We  con¬ 
clude  that  every  stabilizing  controller  can  be  represented  as  K  -  F(  (J,  Q),  as 
in  Figure  5,  tor  some  parameter  9.  which  is  constrained  only  to  be  stable  and 
proper  and  to  make  K  proper. 

The  next  section.gives  an  explicit  state-space  realization  of  one  choice  of 


the  interconnection  matrix  J. 


2.2.4  Realization  of  S : 


Recall  from  Section  2  that  we  have 


<?  = 


where  ( A,B )  is  stabilizable  and  (C,A)  is  detectable.  To  obtain  a  right- 
coprime  factorization  of  G,  choose  a  matrix  F  such  that  A+BF  is  stable. 
Lemma  1.  A  stabilizing  state  feedback  F  yields  ref  G  =  NJH~l  where 


A+BF 

B 

F 

I 

C+DF 

D 

Proof:  That  G  =  NU~l  follows  from: 


(1) 


A 

i 

A+BF 

B 

C 

[5 

F 

I 

A  BF 

B 

= 

0  A+BF 

B 

C  DF 

D 

A+BF  BF 

B 

S 

0  A 

0 

C+DF  DF 

D 

A+BF 

B 

C+DF 

D 

(cascade  of  two  systems) 


(by  change  of  basis  in  the  state -space) 


(deletion  of  uncontrollable  part) 


=  N. 

That  N  and  M  are  right-coprime  will  follow  from  (3)  below. 


QED 


Note  that  for  any  nonsingular  Z 


2 


is  also  a  realization  of  an  ref  of  G. 

By  duality,  to  get  a  left-coprime  factorization  of  G,  take  H  such  that 
A+HC  is  stable. 

Lemma  1’.  A  stabilizing  output  injection  H  yields  lef  G=M~lN  where 


The  next  step  is  to  specify  17, ,  V9,  ut,  V,  to  satisfy 

V,  U„\  [/ 

-V  M  j[Ar  V.)  =  [o 

The  Idea  behind  the  choice  of  these  matrices  is  as  follows.  Using 

observer  theory.  And  a  controller  K,  achieving  internal  stability.  Perform 

factorizations 

K  =  u,v-'  =  v;'u, 

analogous  to  the  ones  just  performed  on  G.  Then  Lemma  3. 1  implies  that  the 
left-hand  side  of  (3)  must  be  invertible  in  RHm.  We  shall  see  that,  in  fact,  (3) 
is  satisfied. 

The  equations  for  K,  are 

£  =  AS  +  Bu  *  H(CS  +  Du  -y) 
u-FS, 
that  is, 

a;  := 


Define 


and  the  following  theorem  holds. 


Theorems. 

Equation  (3)  is  satisfied. 

Proof:  Verification  of  (3)  is  immediate  using  (7),  (B),  and  the  inversion  for¬ 
mula  for  systems  (prelim). 


A  realization  of  /  is  now  immediate.  Substitution  of  (l),  (4).  (5), 
and  (6)  into  (3. 13)  leads  after  simplification  to 

+BF+HC+HDF \-H  B+HD} 


Let's  recap.  We  began  with  the  (stabiiizable  and  detectable)  realization 


4 


We  chose  F  and  H  so  that  A+BF  and  A+HC  -were  stable.  Define  J  by  (9). 
Then  the  proper  K"  s  achieving  internal  stability  are  precisely  those 
representable  as  in  Figure  5,  where  Q  e  RHm  end 

I  +  DQ{<»)  is  invertible 
(The  last  condition  is  equivalent  to  the  one 

(/  +  V?'NQ)(~)  is  invertible 
which  is  required  as  per  Theorem  1). 

This  representation  result  has  an  interesting  interpretation  :  every 
internal  stabilization  amounts  to  adding  stable  dynamics  to  the  plant  and 
then  stabilizing  the  extended  plant  by  means  of  an  observer.  The  precise 
statement  is  as  follows;  for  simplicity  of  the  formulas,  only  the  case  of 
strictly  proper  G  and  K  is  treated. 

Theorem  2. 


Assume  G  and  K  are  strictly  proper  and  the  system  in  Figure  3  is  inter¬ 
nally  stable.  Then  G  can  be  embedded  in  a  system 


and  At  is  stable,  such  that  K  has  the  form 


where  A,  *  BtF,  and  A,  +  Ht  C,  are  stable. 


5 


Proof.  K  is  representable  as  in  Figure  5  for  some  Q  in  RHm.  For  K  to  be 
strictly  proper,  so  must  Q  be  (see  (3.8)).  Take  a  minimal  realization  of  Q: 


is. 

0 

Since  Q  c  RHm.  A a  is  stable.  Let  x  and  xa  denote  state  vectors  for  J  and  Q 
respectively,  and  write  the  equations  for  the  system  in  Figure  5  : 

x  x  (A  +  BF  +  HC)x  -  Hy  +  By{ 
u  =  Fx  -r  2/j 
•Uj  =  —Cx  +  y 
xa  =  A*xa  +  Ba  uj 
yi  s  C.X, 

These  equations  yield 

*•■(*+  B'F,  -  HtC,)x,  -  H.y 

u  =  F,x9  , 
where 


and  A, .  B, ,  £  are  as  in  (10). 


QED 


2.2.5  Closed-Loop  Transfer  Matrix 

Theorem  1  provides  a  parametrization,  in  terms  of  Q,  of  all  proper  JCs 
which  achieve  internal  stability  in  Figure  1.  The  goal  in  this  section  is  to 
express  the  transfer  matrix  from  v  to  e  in  terms  of  Q. 

A  stabilizing  K  is  representable  as  in  Figure  5.  Substitution  of  the  block 
diagram  in  Figure  5  into  that  in  Figure  1  leads  to  the  one  in  Figure  8.  Elimi¬ 
nation  of  the  signals  u  and  y  leads  to  Figure  7  for  a  suitable  transfer  matrix 
T.  Thus  all  closed-loop  transfer  matrices  are  representable  as  in  Figure  7.  It 
remains  to  give  a  realization  of  T. 

We  must  first  put  back  the  original  notation  which  was  simplified  at  the 
end  of  Section  2.  Let  * 


be  a  minimal  realization  of  P,  and  choose  F  and  H  so  that  A+BzF  and 
A+/fCa  are  stable. 


Lemma  4.  > 

T 

Proof.  With  the  original  notation  we  have  from  (..4.9)  that 


Partition  J  and  T  accordingly: 


The  equations  governing  the  system  in  Figure  6  are 


\Pn 

0 


0 

Jzz 


v 

\y  i 


I 

-Ju 

U 

-PZ2 

I 

y, 

o  /12 
=  \p2l  o 


while  those  for  Figure  7  are 


e 

7*11  7*18 

V 

ui| 

T«  Tzt 

Vi 

Therefore 


Tn 

T 12 

Pn  o 

Pl2 

0 

I  -Jn 

-i 

0  Jl2 

T»i 

7*82 

** 

0  J 22 

0 

Til 

'“Pzi  l 

Pi  l  0  , 

The  strategy  now  is  to  use  the  representations  (l)  and  (3)  in  (4)  to  obtain  (2). 
The  algebra  is  long  and  tedious,  but  straightforward,  so  the  details  are  omit¬ 
ted. 


Partition  T  according  to  (2): 


T  = 


7*11  ^18 
7ji  Tz2 


Explicitly,  we  have 


A+B2F 

-b2f 

Tn  = 

0 

a+hc2 

B \+HD2\ 

(5a) 


state  feedback.  Using  the  notation 


3 


N  '  C  \D 


A+BF  B2 
C+DF  D2 


we  will  use  (l)-(3)  to  get  N  inner  and  A+BF  stable.  From  (2)  we  have  that 
Z=R~^U  where  R~D'D> 0  and  U  is  any  orthogonal  matrix.  Take  U=I.  Equa¬ 
tion  (1)  implies  that 

R~*B'X  -r  R~*D'(C+DF)  =0 

so  solving  for  F  yields 

F  =  -R~\B'X  +  D’C)  (9) 

Then  equation  (3)  yields 

0  =  A'X  *  )Gl  -  Ct 
-  (A+BFYX  -  X(A+BF)  -  ( C+DF)\C+DF ) 

=  <,A-BR-xD-C-BR~lB'X)'X  -r  X(A-BR-'D'C-BR-xB’X) 

+  (C-DR-'B'X-DR-'D’CyiC-DR-'B’X-DR-'D'C) 

=  ( A-BR-'D-cyx  +  X(A-BR~lD'C)  -  XBR~xB'X  +  CDgiC  (10) 

since  D±D£  -  I-DR~XD'.  Thus  A_=Ric  [Ay],  where 


U-B 
=  ~C 


-BR~lD'C  - BR-'-B ’ 

-C'DjPiC  -(A-BR~lD'C)' 


That  ^f=Hic  (Ay)  exists  such  that  A+BF  is  stable  follows  from  Theorem 
2. 1  as  follows.  Let 


if  cc 

CD 

(sI-A)-'B 

1[. D'C 

D'D 

I 

That  T  =  GmG  >  0  is  true  by  assumption.  To  satisfy  Theorem  2.1  we  must 
have  (A  ,  B)  stabilizable  and  (P  ,  A)  detectable,  but  this  is  immediate  since 


2 


and  Z  can  be  any  nonsinguiar  matrix.  To  obtain  a  ref  with  N  inner,  we  sim¬ 
ply  need  to  use  equations  (l)-(4)  to  solve  for  F  and  Z.  This  yields  the  follow¬ 
ing  theorem: 


Theorem  1 : 


Assume  pim,  Then,  there  exists  a  ref  G  =  NM~X  with  N  inner  if  and 
only  if  G*G  >  0  on  the  j  i>-axis,  including  at  This  factorization  is  unique  up 
to  a  constant  unitary  multiple. 


A  particular  realization  for  the  factorization  is 


A+BF 

BR~ * 

s 

F 

R-'* 

C+DF 

DR- * 

(5) 


where 


R = DD  >  0 

F  -  -R-\B'X-D'C) 
and 


X  =  Kc 


A- 


—BR~lD'C 

CDiPlC 


- BR~lB 1 
-( A-BR-'D’Cy 


%  0 


(6) 

(7) 


[Proof] : 

(only  if)  : 

Suppose  G  -  NU~X  is  a  ref  and  N*N-l.  Then 
GmG  =  =  (ST')**!-1  >  0  on  the  ;waxi3  since  U  e  RH„ 

(if): 

The  if  part  will  be  proven  by  showing  that  (l)-(4)  lead  directly  to  the 
above  realization  of  the  ref  of  G  with  inner  numerator.  That  G  =  NM~X  is  an 
ref  follows  immediately  from  (4)  once  it  is  established  that  F  is  a  stabilizing 


2.3.4  Inner-Outer  and  Spectral  Factorization : 


In  this  section,  the  special  form  of  coprime  factorizations  required  to 
reduce  the  general  Ha  optimal  control  problem  to  a  best  approximation 
problem  will  be  developed.  In  particular,  explicit  realizations  are  given  for 
coprime  factorizations  G  -  NM~l  with  inner  numerator  N  (Theorem  1)  and 
inner  denominator  U  (Theorem  3);  and  for  the  complementary  inner  factor 
which  completes  the  inner  numerator  to  make  |/V  Njj  square  and  inner 

(Theorem  2).  The  theorems  will  be  stated  for  right  coprime  factorizations 
(ref)  with  the  duals  for  Lcf's  following  just  as  for  the  general  case  of 


coprime  factorization  developed  earlier. 


For  the  following  theorems,  it  is  assumed  that  G  -  e  and 

the  realization  is  minimal.  We  will  denote  by  R^(R> 0)  the  symmetric 


matrix  such  that  J&R&  =  R  and  use  "D£  for  any  orthogonal  complement  of 
D  so  that  \dR~*  Z?j]  (with  R-D'D)  is  square  and  orthogonal. 


Recall  from  Corollary  0.3.3. 1  that  N  = 


is  inner  if  and  only  if 


i)  B'X  +  B'd  =  o 

(i) 

u)  B'B  =  / 

(2) 

where  the  observability  gramian  X  solves 

ax  +  x2  +  d’d  =o 

(3) 

From  Lemma  2.4.1  a  stabilizing  state  feedback  F  yields  ref  G=NM~X 
where 


3 


Remark : 

The  unique  stabilizing  solution  of  Theorem  1  will  be  denoted  by  Ric  (%). 
Note  that  this  theorem  is  more  general  than  Theorem  2.1  from  the  previous 
section  since  no  detectability  assumptions  are  made.  The  following  theorem 
will  play  an  important  role  in  the  next  section  in  obtaining  complementary 
inner  factors. 

Theorem  2 : 

If  Q  -  HrH  a  0  in  (ARE)  mid  X  is  its  solution,  then  Ker(X)  c  Ker {H). 


I 


2 


I  0 

E  -W 

I  0 

-X  I 

-Q  -E7 

X  l 

E-WX  -V 

~  \-(ETX+XE-XWX  +  Q)  -{ E-WX)7 

\e-wx  -W 
~  [  0  -{E-WX)T  • 

This  puts  Afj  in  block  upper  triangular  form  and  clearly  exhibits  a  particular 
partitioning  of  the  eigenvalues  of  An  with  respect  to  the  imaginary  axis.  For 
example,  if  E-WX  has  all  its  eigenvalues  in  C_,  then  -(E-WX)7  has  all  its 
poles  in  C+.  Thus,  the  solution  of  ARE  which  stabilizes  E-WX  yield  a  decom¬ 
position  of  Ay  into  stable  and  unstable  parts. 


This  section  will  explore  the  conditions  under  which  the  desired  solution 
of  ARE  exists.  There  is  a  considerable  literature  addressing  the  theory  of 
ARE  (eg.  [And],  [Cop],  [Kuc],  [Mat],  [Mol],  [Pot],  [Wll]),  and  it  is  not  the  pur¬ 
pose  of  these  notes  to  give  a  detailed  treatment  of  this  subject.  We  will  sim¬ 
ply  review  the  results  which  are  relevant  to  the  factorization  theorems  in  this 
report 


Now,  we  are  going  to  state  the  main  theorem  of  this  section  which  gives 
the  necessary  and  sufficient  conditions  for  the  existence  of  a  unique  stabiliz¬ 
ing  solution  of  (ARE).  Without  loss  of  generality,  we  will  assume  that 
W  a  GGr. 


Theorem  1 : 

The  stabilizability  of  ( E,G )  and  Rs[\(A#)]  **  0  (V  i  =  1,2, . 2 n)  is 

necessary  as  well  as  sufficient  for  the  existence  of  a  unique  stabilizing  solu¬ 
tion  of  (ARE). 


2.3.3  Solution  of  the  Algebraic  Riccati  Equation : 


P 

L 

LV 

tv 

u’.* 

i 

\ 


Consider  once  again  the  Algebraic  Riccati  Equation, 

EtX  +  XE  -  XVX  +  Q  =  0  (ARE) 

where 

E.W.Q  C  nr*",  W  =  WT  i  0  and  Q  =  Qr 
with  the  associated  Hamiltonian  matrix 


to 

r 


n 

> 

i  - 


A*  = 


E  -W 
-Q  -Er 


(Hamiltonian) 


Our  main  interest  is  to  find  the  unique  real  symmetric  stabilizing  solution 
such  that  the  matrix  (E  -  WX)  is  asymptotically  stable.  For  simplicity  we 
will  use  "solution"  of  the  ARE  to  mean  a  real  symmetric  one.  The  ARE  con¬ 
sidered  here  is  more  general  than  the  ARE  which  arises  in  linear  quadratic 
optimal  control  and  Kalman-Bucy  filtering  theory  in  that  there  is  no  assump¬ 
tion  on  the  definiteness  of  the  matrix  Q. 

An  important  property  of  the  Hamiltonian  matrix  Ag  is  that  the  distribu¬ 
tion  of  its  eigenvalues  (denoted  as  A (A#))  is  symmetric  with  respect  to  both 
the  real  and  Imaginary  axes,  Le.,  if  X  e  A (Ag)  with  multiplicity  fc.  so  is 
X,  —A,  and  -X.  Therefore,  A  can  be  partitioned  as  A}  and  Ag  so  that 
X  €  Aj  with  multiplicity  k  implies  that  X  c  A*  and  — X  ,-X  e  Ag  all  with 
the  same  multiplicity. 

One  connection  between  the  ARE  and  Ag  can  be  seen  by  assuming  that  X 

i  i 


/  0 

is  a  solution  to  ARE  and  conjugating  Ag  by  j 


to  yield 


then  from  (4)  and  (5) 

0‘6>./  -  A)Xi  =  0  (7) 

(jo, I  *  A‘)xg  =  -Px i  (8) 

Since  (7)  implies  x*(juoI+A')-0,  from  (8)  we  have  r •ft,=0.  This  implies, 
along  with  (7)  that  ( P,A )  is  not  detectable.  Hence  £x,  *0.  Now  Lemma 
0.2.3.8  implies  that  there  exists  u,  #0  such  that  V(j  o)u,  =0.  This  contradicts 
the  hypothesis  that  T(;o)>0.  Hence  (a)  -*  (c). 

(b)  -*  (a)  Suppose  z.X-X  such  that  E-BR~xB‘X  -  A-BR~X(S'  +  B'X)  is 
stable.  Let  F--R~X(S’  -rB’X)  and 


It  is  easily  verified  by  use  of  the  Riccati  equation  for  X  and  routine  algebra 


that  f  =  Li*R!A  so 


I* ”*(*)  =  U-\s)R-x(U'(- s))’1 


U~x  * 


\+BF  B 
F  I 


So  U~x  €  RHm.  Thus  e  RLm  and  for  all  Qasw*- 

r'O'u)  *  U-x(jo)R(U‘(-jo))-x  >  0 
Hence  V(jo)  >  0  and  (b)  -*  (a). 


(c)  *+  (b)  This  is  part  of  Theorem  3. 1  in  the  next  section. 


2.3.2  Riccati  Equations  and  Factorizations 


Consider  the  Algebraic  Riccati  Equation, 

E'X  +  XE  -  XWX  +  Q  =  0  (ARE) 

where 

E,  If,  Q  €  B?*xn.  W  =  r  %  0  and  Q  =  Q' 
with  the  associated  Hamiltonian  matrix 

Afj  =  fg  _g,  (Hamiltonian) 

The  following  theorem  and  corollary  characterizes  the  relationship 
between  spectral  factorization.  Riccati  equations,  and  decomposition  of  Ham¬ 
iltonians. 

Theorem  1  Let  A,B,P,S,R  be  matrices  of  compatible  dimensions  such  that 
P-P' ,  R=R‘>0.  with  (A.B)  stabiiizable  and  (P,A)  detectable.  Then  the  fol¬ 
lowing  statements  are  equivalent. 

a)  The  parahermitian  rational  matrix 

w  -  xr-  4  fr* 

satisfies 

V(Ju)  >  0  for  all  OsuS« 

b)  For  E~A-BR~lS' ,  W=BR~lB‘  and  Q=P—SR~'iS\  there  exists  a  unique 
real  X -X'  such  that 

E'X  +  XE  -  XWX  +  Q  =  0 


and  E-BR~lB'X  is  stable. 


The  key  idea  behind  the  factorizations  in  this  chapter  is  the  connection 
between  inner  functions,  Rice  at  i  equations,  and  spectral  factorization. 
Indeed,  the  factorization  is  just  a  special  case  of  spectral  factori¬ 

zation  ([You], [And]).  Sections  2  and  3  develop  the  fundamental  properties 
about  Riccati  equations  that  will  be  needed  in  Section  4  to  construct  the 
desired  factorizations.  The  material  in  Sections  2  and  3  is  for  the  most  part 
well-known  within  the  control  theory  community  ([And],[Cop], 
[Kuc],[Mat],[Moi].[Pot].[Wn]),  although  Theorem  1  of  Section  2  is  apparently 
a  somewhat  novel  description  of  the  connection  between  spectral  factoriza¬ 
tion.  Riccati  equations,  and  decomposition  of  Hamiltonians. 

Section  4  shows  that  coprime  factorizations  with  inner  numerator  and 
complementary  inner  factors  both  involve  using  a  state  feedback  or  output 
injection  matrix  based  on  a  Riccati  solution.  This  provides  a  reliable  compu¬ 
tational  method  based  on  standard  approaches  to  finding  solutions  of  Riccati 
equations  ([Pot],[Lau]).  Section  5  completes  the  solution  to  the  Hm  optimal 
control  problem  by  completing  the  parametrization  of  the  optimal  controller 
using  the  factorizations  developed  in  Section  4. 


2.3  Factorization 


2.3.1  Introduction  to  Factorization 

The  last  chapter  developed  methods  for  finding  J's  so  that  the  substitu¬ 
tion  K-Ti{J.Q)  yields 

7i(P.J0  =  TdPW.Q)) 

*  Ti(T.Q) 

-  Tn  +  TizQTu  (1) 

with  the  additional  requirement  that  T  e  Hm  and 

Fj  (P.K)  internally  stable  (2) 

i a  Q  €  H.. 


This  parametrizes  ail  stabilizing  K's  in  terms  of  a  stable  {?£/?„  in  addi¬ 
tion  to  providing  an  affine  parametrization  of  all  stable  Ti(P,K).  The  actual 
structure  of  stabilizing  parametrizations  J  was  in  terms  of  an  observer-based 
compensator.  The  stabilizing  state  feedback  and  output  injections  of  the 
observer-based  compensator  were  shown  to  provide  coprime  factorizations  of 
P  and  solve  the  Bezout  identities  necessary  to  provide  the  parametrization 
of  all  stabilizing  controllers. 


In  this  chapter,  the  requirement  is  added  that  Ta  and  Ta  be  inner,  that 


is  TaTi2~I  and  TjiTg*  =/.  In  addition,  we  find  7j_and  fj_so  that  Tjg  7jj  and 


Tz  i 

AJ 


are  square  and  inner.  This  provides  the  necessary  factorizations  to 


complete  step  2)  in  Section  2.1.5  on  the  Rational  Matrix  Generalization.  The 
final  missing  step  in  completing  the  solution  in  the  rational  case  is  to' find 
M  e  RHm  for  a  given  G  £  RLm  with  ||c|j.  <  y  such  that  e  RHm  and 


The  symbol  is  used  to  denote  this  U. 


Tiz  = 

A+BsF  B% 

Ci  +  DizF  Diz 

A+HCtiBi  +  HDii 

Tu  - 

Ca  Dzi 

Tzt  - 

3. 

In  Figure  7  the  governing  equations  are  therefore 

«  =  Tu  v  +  Tuy, 

“1  =  Tziv 
Vi  =  Quu 

so  that 


(5b) 

(5c) 


e  =  (Fn  +  TiaQTgi)v. 

In  summary,  we  have 

Theorem  3.  The  set  of  all  closed-loop  transfer  matrices  from  v  to  e  achiev¬ 
able  by  an  internally  stabilizing  proper  controller  is  equal  to 

Tu  +  TizQTzi  :  Q  C  RH -  I  +  £«£(*)  invertible j. 

The  important  points  to  note  are  that  the  closed-loop  transfer  matrix  is 
simply  an  affine  function  of  the  controller  parameter  matrix  Q  and  that  the 
coefficient  matrices  Tq  have  very  simple  realizations,  namely,  as  in  (5). 


r 


* 
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the  realization  for  G  was  assumed  minimal.  Thus,  Theorem  2.1  ensures  that 
X  -  Ric  {An)  exists  such  that  A+BF  is  stable. 

The  uniqueness  of  the  factorization  follows  from  ooprimeness  and  N 
inner.  Suppose  that  G  =  N\U\X  -  N2M£l  are  two  right  coprime  factoriza¬ 
tions  and  that  both  numerators  are  inner.  By  coprimeness,  these  two  factor¬ 
izations  are  unique  up  to  a  right  multiple  which  is  a  unit  in  RH2*m.  That  is. 


there  exists  a  unit  Q  €  RH?*m,  such  that 


0  = 


M2 

n2 


.  Clearly,  0  is  inner 


since  0*0  =  Q*N*NiQ  =  N2N2  -  I.  The  only  inner  units  in  RHm  are  constant 
matrices,  and  thus  the  desired  uniqueness  property  is  established.  Note  that 
the  nonuniqueness  is  contained  entirely  in  the  choice  of  a  particular  square 
root  of  R. 


Q.E.D. 


In  a  similar  manner  equations  (l)-(3)  can  be  used  to  obtain  the  comple¬ 
mentary  inner  factor  (£3F}  in  the  following  theorem. 

Theorem  2 : 


If  p  >  m  in  Theorem  1,  then  there  exists  a  CD*  e  RHJ***^  such 
that  the  matrix  | N  tfjj  is  square  and  inner.  A  particular  realization  is 

where  X  and  F  are  from  Theorem  1  and  XT  is  the 


A+BF 

-X*C’D  | 

C+DF 

Dl  1 

N±  = 

pseudo-inverse  of  X  . 


[Proof] : 
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The  proof  consists  of  verifying  directly  that  JVjj  is  inner  using  the 

above  realization  for  iVj_and  the  realization  for  N  from  Theorem  1.  Using  the 
notation 


1 

\a  8 

A+BF\BR~*  -tfC'Di 

N  Afjl  = 

[c  t> 

C+DF\DR~*  Dl 

(13) 


and  the  fact  that  Ker(.Y)  C  Ker^fC)  (Theorem  3.2),  equations  (l)-(3)  follow 
immediately.  Thus  |w  is  inner. 


Theorem  3 : 


There  exists  a  ref  G  -  N!d~x  such  that  M  C  RH?”*  is  inner  if  and  only 
if  G  has  no  poles  on  the  j  i^-axis.  A  particular  realization  is 


M 

N 


A+BF 

2. 

s 

F 

I 

C+DF 

D, 

where 


e 


(14) 


F  =  -B'X 

end 


(15) 


X  =  Ric 


A 

0 


-BB' 

-A' 


fc  0 


(18) 


[Proof] : 

The  proof  is  essentially  the  same  as  for  Theorem  1.  The  details  are 
straightforward  and  are  omitted. 


A. 


-  r. - a. 


i 


J§ 

t-s 
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In  the  following  theorem,  we  may  assume  that  G(s)  is  stable  without  loss 
of  generality.  Any  G  C  RLm  may  be  factored  using  the  dual  of  Theorem  3  to 
obtain  a  stable  numerator  N  such  that  N*N  =  G*G. 

Theorem  4 :  (Spectral  Factorization) 

Assume  G(s)  €  RHJ*m  and  7  >  ]|G(s)|U  Then,  there  exists  a 
M  e  RHmmxm  with  stable  inverse  such  that  M*M  =  7 */  —  G*G  with 


U  = 
where 


A 

B 

-R*KC 

R * 

R  =7*/  -  D'D  >  0 
Kc  =  —R~1(B'X  ~  D'C) 


A+BR~lD'C  -BR-'B- 
C(I+DR~1D')C  -(A+BR-'D'C)' 


if-CC  -C"X? 

( sI-A)-'B 

j[-Z7'C  R 

I 

X  =  Ric 
[Proof] : 

Let 

r  =  -f-I  -  GmG  =  jp’(-s/- 
where  R  =  y*I  -D'D. 

Since  7>  ||C|U  rO’«)>0.  The  minimality  of  the  realization  of  G(s) 

guarantees  that  (A  ,  B)  is  controllable  and  (-CC ,  A)  is  observable.  Thus, 
from  Corollary  2.1,  there  exists  M(s)  e  R£'xm‘  such  that  T  -  U*M  and  a  par¬ 
ticular  realization  is 

U  = 


A 

[b_ 

-R^-Kt 

\R* 

L 


where 


7 


K>  -  -R~\B'X-D'C) 


and 


X  *  Ric 


[C'(I+DR-lD‘)C  —(A+BR~lD'C)'\ 
Since  G  is  stable,  we  conclude  that  M  e  RHJ**m. 


Q.E.D. 


Remarks : 

(1)  The  minimality  condition  in  Theorem  3  can  be  weakened  to  ( A,B ) 
stabilizable  and  A  has  no  eigenvalues  on  the  axis  and  the 
theorem  still  holds. 

(2)  If  G  €  RHJ’*m  in  Theorem  1,  then  U  is  a  unit  in  RH-  and  JU"1  is 
"outer”.  In  this  case.  G  -  N(Ii~l)  is  called  "inner-outer  factoriza¬ 
tion"  (lOF). 

(3)  Dual  results  for  all  factorizations  can  be  obtained  when  p  sc  m.  In 
these  factorizations,  output  injection  using  the  dual  Riccati  solution 
replaces  state  feedback  to  obtain  corresponding  left  factorizations. 

(4)  The  symbol  (7 a/  -  G*G)^  will  be  used  to  denote  the  M  in  Theorem 
4. 


-• 


2.3.5  Parametrizing  the  Optimal  Controller 


This  section  combines  the  results  of  Youla’s  parametrization  and  the 
coprime  factorization  to  parameterize  all  stabilizing  controllers  in  a  way  that 
is  convenient  for  solving  optimal  Lg  and  Lm  control  problems.  This  completes 
the  rational  generalization  of  the  constant  matrix  case  as  outlined  in  Section 
2.1.5.  The  only  remaining  step  is  a  standard  Hm  approximation  problem. 

Let 


\Pu  Pa 

A 

By  Bg 

P  =  [Pti  P* 

s 

Cy 

Dyy  Dyg 

Cg 

Dgy  Dzg 

(1) 


Suppose  that  neither  Pa  nor  Pgy  has  transmission  zeros  on  the  jo-axis 
(including  “  )  and.  without  loss  of  generality.  Dyg7Dyg  -  I  and  DzlDg7  -  I. 
Under  these  assumptions,  let  Dj_-  (Z5ia)j_  and  £j_=  that  is,  |i?i2  X>j| 

and  ^]DgiT  3jfj  are  orthogonal  matrices.  Then,  factor  P  as  before  with  F 
and  H  given  as  follows  : 


F  =  HDygTCy  +  BgrX) 

A  ~~  BgDyg7  Cx  —BgBg7 

-  Cy7D$[ Cy  -(A -BgDXgrCy)r: 


X-  Ric 


(2) 


and 


H  =  HByDgy7  H-  YC,7) 

\(A-ByDgyTCg)T  -Cg7Cg 
r=RlCl  - ByDfDiBy 7  -{A -ByDgy7Cg)\ 

Then,  NygmNyg  -  I  and  NgyNgy*  -  /.  Also,  let  ^and  ^j_be  CIF's  so  that 


(3) 
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*1 1  B 12  *12  j  r^.  ] 

where  R  =  ^  =  [//^j  (^V)n  [*2i*  *i*|. 


(9) 


The  a  -  2  case  is  particularly  simple  and  it  possible  to  get  explicit  for¬ 
mulas  for  the  optimal  controller.  Since 

1H 

(10) 


j  1  /?u  t  ^  /?  ia 

1 

1 1 

2 

0 

2 

III  #81  R& 

L 

P?n+« 

+ 

2 

Rgg 

2  , 

the  optimal  Q  is  seen  immediately  to  be 

To  obtain  an  explicit  expression  for  Q^,  we  need  to  compute  R. 

«N* 

^  .  It  is  convenient  to  compute 


Note  that  (JVV)n  *  |*n  R\z 


*12* 

*12**»  / 

*1 

*11  *12 

*£  0 

and 


r*- 

i'll 

W 

1*21*  *1 

seperately. 
Claim  1 : 


[*ia* 


M = 


-(A^BtF)7 

(fi*  DigF)TDn  +  XB , 

-BZT 

B\tT  B\\ 

Dfcxr 

DfDn 

(id 


(12) 


(13) 


(14) 


iff 


(21) 


+  $!|-  «  1  (22) 
where  G=Rn/J~l  E  RHm  and  Q=QM~l.  Solving  (22)  for  $  E  RHm  solves  (20) 
for  Q  £  RHm.  Note  that  Q-QH  is  in  RHm  if  $  is,  since  H  £  RHm  by  construc¬ 
tion. 

The  final  step  in  the  rational  case  then  involves  solving  (22)  for  $  £  RHm. 
This  is  a  standard  mathematical  problem  of  approximating  an  Lm  matrix  by 
an  Hm  matrix 


3.1  LSS  Control  Perspective 


This  part  will  outline  the  main  issues  Involved  in  applying  the  methods  of 
the  previous  parts  to  large  space  structure  (  LSS  )  control  problems.  The 
most  fundemental  issue  is  how  the  performance  specifications  and  uncer¬ 
tainty  can  be  put  into  the  general  framework  developed  in  Part  1.  For  the 
most  part.  LSS  control  problems  involve  the  usual  problems  of  providing 
tracking  and  disturbance  rejection  in  the  presence  of  plant  uncertainty  and 
noise. 

A  significant  distinguishing  feature  of  LSS  control  seems  to  be  that  there 
is  a  great  deal  of  structure  associated  with  the  plant  uncertainty.  On  the  one 
hand,  this  suggests  that  the  methods  of  the  previous  sections,  which  focus 
attention  on  structured  uncertainty,  are  uniquely  suited  to  handle  LSS  con¬ 
trol  problems.  On  the  other  hand,  it  will  require  significant  application- 
specific  modeling  and  analysis  to  exploit  the  known  structural  properties  of 
LSS  uncer  tainty. 

The  remainder  of  this  section  will  outline  the  main  issues  in  control  of 
LSS’s,  witn  emphasis  on  those  associated  with  large  antennas.  The  discussion 
will  be  k*pt  as  elementary  as  possible,  and  will  avoid  the  use  of  the  more 
advanced  of  the  mathematical  techniques  developed  in  the  previous  parts  of 
these  notes.  With  this  as  a  foundation,  the  next  section  will  discuss  specific 
modeling  issues  and  how  they  relate  to  the  methods  of  the  rest  of  these 
notes. 

From  a  controls  perspective,  the  problem  is  to  achieve  the  specified 
mission  performance  from  an  incompletely  known  spacecraft  in  the  face  of 
uncertain  disturbances.  Such  a  problem  invariably  requires  the  use  of  feed¬ 
back.  Because  the  effect  of  feedback  is  critical  to  spacecraft  performance  a 
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review  of  the  feedback  fundamentals  relevant  to  this  problem  is  in  order.  By 
examining  the  properties  of  feedback  in  a  general  setting,  without  regard  to 
the  technique  used  to  generate  the  feedback  law,  fundamental  relationships 
between  performance,  robustness  and  feedback  control  loop  properties 
become  evident.  This  will  make  it  easier  to  appreciate  the  role  that  the 
specific  methods  developed  in  the  preceding  sections  can  play  in  analysis 
and  syn thesis  of  controllers  for  large  space  structures. 

Feedback  Improves  Performance 

A  typical  feedback  situation  for  a  space  pointing  mission  is  shown  in  Fig. 
3*1.  Here  we  have  disturbances  acting  on  a  structure  to  upset  the  primary 
goal  of  the  mission,  namely  to  keep  iine-of-sight1  (LOS)  errors  smaller  than  a 
specified  level,  e, 


j  LOS  -  LOSe  I  s  e. 

It  is  assumed  that  measurements  of  LOS  are  imperfect  due  to  sensor 
dynamics,  sensor  noise  and  (possibly)  the  need  to  infer  LOS  from  related 
measurements.  In  typical  feedback  fashion,  the  sensed  response  is  com¬ 
pared  to  the  desired  or  commanded  value  and  the  error  used  by  a  controller 
to  generate  a  control  signal  to  drive  the  actuators  on  the  structure. 

Each  of  the  elements  cf  this  loop  can  be  represented  by  its  transfer 
function  (TF)  relating  the  Laplace  transform  of  the  response  to  the  Laplace 
transform  of  the  forcing  function  The  error  response  of  the  structure  to 
commands,  LOSe  disturbances,  D,  and  sensor  noise,  ,V,  can  then  be 
developed  by  standard  feedback  equation  manipulation.  Letting 

‘For  simplicity  of  exposition  in  this  section  we  use  the  terra  Iine-of-sight  (LOS)  genertcaLy 
to  refer  to  ell  controlled  variables  of  interest  —  10S,  wavefront,  etc. 


D0  =  G$D  -  disturbance  as  seen  at  the  output  LOS  in  the  absence  of 
feedback 

L  =  KGT  —  feedback  loop  transfer  function  (i.e.,  transmission  around 
the  feedback  loop) 

and  AT  =  Ttn*  -  T  -  LOS  sensor  uncertainty 


we  get 


(LOS  -  L0Se)  = 


-  LGSe ) 


l  ](Af  +  ATLOSc) 

1+Zj  T 


This  equation  relates  system  performance  to  each  error  source.  Its 
various  terms  can  be  interpreted  in  many  ways,  such  as  amplitudes  of  sine 
waves  or  as  signals  in  £a.  In  either  case,  four  immediate  consequences  can 
be  s  ?en: 


Cbnseqtience  No.  1  —The  loop  transfer  function  must  be 
large  to  achieve  small  errors.  This  follows  from  the  first 
term  on  the  right  hand  side  of  the  error  equation.  In  fact, 
in  order  to  meet  our  specified  error  level,  we  must  have 

e  |  1-fL  >  J  D,  -  LOSg  | 

Thus,  at  those  frequencies  where  either  the  disturbance 
responses  or  the  commands  are  large  compared  with  t,  we 
require  Jij  »  1. 

Consequence  No.  2 —Sensor  noise  must  be  small  enough. 
This  follows  from  the  second  term,  which  requires  that 


Figure  3-3.  Suitable  Control  Solution 


igure  3-4.  Effect  of  Spacecraft  Flexibility  on  LOS  Contro 


NO  FCCOBACK  '  FEEDBACK 


f  »i  ■  !  ■  u  ■  .  ■  g  ■■■■.■  •:  -  w  Tg  ■*  7^:  ■■v  *  ■  -jt,-  w  '  v — tt^T. 
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seem  to  be  two  obvious  alternatives  to  this  direct  approach.  The  most  desir¬ 
able  method  would  be  to  actually  check  for  the  robustness  with  respect  to 
the  real  parameter  variations  along  with  those  modelled  as  complex.  This 
clearly  involves  a  generalization  of  the  existing  SSV  and  no  reliable  method 
exists  for  computing  such  an  object  in  any  but  the  most  simple  cases. 

A  more  promising  alternative,  at  least  in  the  near  term,  seems  to  be  to 
replace  the  effect  of  the  real  parameter  variation  with  a  complex  one.  That 
is,  instead  of  using  the  same  linear  fractional  transformation  on  the  A  and 
simply  replacing  the  real  parts  of  A  with  complex  perturbations,  the  linear 
fractional  transformation  is  changed  as  well  so  that  the  set  of  plants 
described  by  the  new  representation  closely  matches  the  original  set  of 
plants.  Preliminary  investigations  of  this  approach  appear  promising,  but  no 
systematic  procedure  has  been  developed  for  achieving  this  model 
simplification. 

Both  of  these  approaches  to  the  real  parameter  robustness  problem 
have  been  investigated  and  will  continue  to  be  important  research  directions. 
The  results  available  to  date  are  rather  fragmented  and  incomplete.  While 
we  have  had  remarkable  success  in  applying  the  SSV  to  several  example 
problems  (including  the  space  shuttle)  involving  real  parameter  variations, 
the  details  of  the  techniques  were  somewhat  problem-specific.  It  is  expected 
that  a  more  coherent  view  of  the  real  perturbation  problem  will  emerge  in 
the  next  year. 


A 


noise  model,  can  be  rearranged  to  fit  the  general  framework  of  Part  I,  shown 
in  Figure  13.  The  major  sources  of  uncertainty  in  this  model  are  summarized 
in  Tables  1.2  and  3.  In  these  tables  "external"  and  "internal"  are  used  to 
describe  disturbances  that  are  generated  physically  either  outside  or  inside, 
respectively,  of  the  spacecraft  In  the  sense  of  Part  I.  all  these  disturbances 
are  external. 

It  is  clear  that  the  model  depicted  in  Fig.  13  will  have  a  great  deal  of 
structure:  the  A  will  have  many  blocks.  Thus,  the  methods  of  Part  I  using  the 
SSV  are  clearly  going  to  play  a  critical  role  in  the  analysis  and  synthesis  of 
controllers  for  LSS’s.  It  is  unlikely  that  it  will  be  possible  to  reliably  handle 
the  inherent  uncertainties  of  LSS  control  using  the  paradigm  of  either  sto¬ 
chastic  optimal  control  or  standard  robust  multivariable  control  theory 
based  on  singular  values. 

On  the  other  hand,  the  application  of  SSV  techniques  to  LSS  control 
appears  to  be  a  challenging  problem.  The  large  number  of  blocks  in  a  rea¬ 
sonable  perturbation  model  of  a  LSS  will  stress  the  current  experimental 
software,  so  improvements  in  computation  and  approximation  methods 
would  be  desirable.  Another  critical  issue  is  that  a  naive  application  of  the 
SSV  to  Fig.  13  for  large  space  structures  could  result  in  very  pessimistic 
results.  The  reason  for  this  is  that  the  natural  way  to  represent  uncertain 
structural  mass,  stiffness,  and  damping  characteristics  is  with  real  parame¬ 
ter  variations.  Thus,  many  of  the  blocks  In  Fig.  13  would  be  real,  and  applica¬ 
tion  of  the  SSV  would  actually  involve  checking  robustness  characteristics 
with  respect  to  complex  variations. 

Simply  replacing  real  perturbations  with  complex  ones  of  the  same  mag¬ 
nitude  would  typically  be  very  conservative  in  LSS  control  problems.  There 


"pseudo"  truth  model,  which  we  recommend,  is  to  introduce  perturbations  of 
appropriate  magnitude  in  ail  uncertain  parameters  of  the  evaluation  model. 

Disturbance /  Command  Models— The  most  critical  disturbances  for  control¬ 
ling  large  space  antennas  are  likely  to  be  high-frequency  vibrations  due  to 
CMGs  and  other  rotating  machinery,  plus  distributed  forces/torques  due  to 
this  same  equipment,  coolant  turbulence,  etc.  These  may  be  approximated 
as  (1)  discrete-frequency  components,  provided  a  sufficient  number  of  har¬ 
monics  are  included  and  uncertainty  in  each  of  the  frequencies  is  accounted 
for  and  (2)  broadband  disturbances.  These  latter  disturbances  will  likely  be 
dominated  by  the  large  command  requirements  for  slew  maneuvers. 
Bandwidth  of  these  command  requirements  will  be  minimized  by  the  use  of 
appropriate  command  shaping  where  applicable.  Although  the  numbers  will 
change,  the  general  characteristics  of  these  disturbances  is  illustrated  in  the 
plot  of  Fig.  3-10. 

Sensor/ Actuator  Models—  Unear  models  for  sensors  and  actuators  will 
include  all  relevant  dynamics  for  frequencies  out  to  roughly  a  decade  beyond 
the  desired  control  loop  gain  crossover.  The  effects  of  unmodeled  high- 
frequency  dynamics  and  any  significant  nonlinearities  can  be  approximated 
using  the  frequency-dependent  multiplicative  uncertainties  illustrated  in  Fig. 
3-11  and  3-12. 

Analysis  Issues 

Figures  3-9,  3-11,  and  3-12  can  be  combined  to  obtain  the  desired  nomi¬ 
nal  plus  perturbation  model  of  the  LSS  that  is  required  to  use  the  analysis 
tools  of  Part  I.  This  perturbation  model,  together  with  a  disturbance  and 


G(Ju)  £  A(*)C(bf 

where 

*<I)S 

This  model,  as  usual,  admits  a  state-space  representation  of  the  form 


X  -  AX  -r  BM 


y  =  Cx  ~  Du 
where 


A  -  diag 

B  -  col 

C  =  row 
C  =  0 


0  1 
“« t2  “2ft  «t 


0 

M 
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Because  the  truth  model  is  not  available  to  the  designer,  control  designs 
developed  using  the  design  model  can  only  be  directly  verified  using  the 
evaluation  model,  which  accounts  only  for  the  known  (but  largely  unstruc¬ 
tured)  errors,  &G,(ju).  Thus  he  must  "design"  to  accommodate  both  these 
known  errors  as  well  as  unknown  errors,  which  Include  both  structured 
errors  (AA.  ixB,  AC)  and  largely  unstructured  errors,  A £»(?«)•  It  is,  of 
course,  possible  to  define  a  "pseudo"  truth  model  by  deliberately  introducing 
perturbations  in  the  structural  data  used  to  define  the  finite-element  evalua¬ 
tion  model,  as  was  done  for  some  of  the  ACOSS  studies.  This  approach,  unfor¬ 
tunately,  is  prone  to  making  the  same  approximation  errors  in  the  "pseudo" 
truth  model  as  exist  in  the  evaluation  model.  A  safer  approach  to  defining  a 
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finite-element  modeling  technique  (e.g.,  NASTRAN).  It  has  the  form 


"» 

<%(»)  I  (Sigrt(a)ea(b£ 

where 


9,\(s)  £ 


_ 1 

s2+2  t«u«s+u& 


i=l,2 . n. 


Note  that  this  model  differs  from  the  truth  model  in  two  respects.  First,  it  is 
of  lower  (finite)  order.  This  difference  can  generally  be  approximated  as  a 
bounded  frequency-dependent  multiplicative  error,  A flj,  which  is  reflected 
either  to  the  input  or  output  (as  illustrated  in  Fig.  3-9)  of  the  system.  The 
second  difference  is  a  more  systematic  one  -  the  model  parameters  are 
different.  This  is  due  to  a  number  of  factors  -  poorly  known  structural  pro¬ 
perties.  too  coarse  a  finite-element  grid,  nonlinearities,  etc.  Such  errors  are 
highly  structured  and  generally  cannot  be  reflected  to  either  the  input  or 
output  of  the  system,  without  substantially  increasing  the  corresponding 
bounds  for  these  errors. 


Th£  third  model,  or  design  model,  is  a  lower-order  approximation  to  the 
evaluation  model,  which  is  used  for  control  design.  It  is  derived  from  the 
evaluation  model  by  means  to  be  described  shortly.  Since  it  is  used  for  con¬ 
trol  design,  it  should  not  differ  greatly  from  the  evaluation  model,  except  at 
high  frequencies  well  beyond  desired  control  loop  gain  crossover.  These 
differences  can  generally  be  represented  by  the  bounded  frequency- 
dependent  multiplicative  error,  AC,,  illustrated  in  FLg.  3-9.  Depending  on 
control  requirements,  this  model  may  sometimes  differ  also  at  low  fre¬ 
quency.  The  design  model  has  the  general  form 


3.2  Modeling  and  Analysis  Issues 

We  next  examine  models  and  model  uncertainty  for  critical  elements  of  the 
control  design  problem,  and  to  see  how  the  LSS  control  problem  fits  into  the 
framework  of  Part  I. 

Modal  Definition 

Spacecraft  Models  -To  properly  motivate  the  issue  of  model  error,  we  define 
three  spacecraft  models,  which  are  illustrated  in  Fig.  3-9.  The  first  model,  or 
truth  model,  is  supplied  by  "Mother  Nature."  It  is  an  infinite-dimensional 
model  which  can  be  represented  in  the  form  (assuming  either  position  or 
attitude  outputs  ond  force  or  torque  inputs) 


Gt(s)  &  £  $N(s)c«b£ 


where 


9ti(s)  i  •  1 - j-  i=l,2, 

s2+Z^tiutis  ruj 


Here  and  denote  frequency  and  damping  for  the  i*  mode  while  bM  and 
cH  denote  input  and  output  influence  coefficient  vectors  for  ail  relevant  input 
and  output  nodes.  Line-of-sight  and  wavefront  errors  can  also  be  expressed 
in  this  same  form  by  appropriate  definition  of  the  cM's,  as  can  disturbance 
inputs  by  appropriate  definition  of  the  b«‘ s.  Rate  and  acceleration  outputs 
can  be  handled  by  introducing  s  and  s8  numerator  terms  to  the  p«’s. 


vn 


The  second  model,  or  evaluation  model,  is  a  high-order  finite-dimensional 
approximation  to  the  truth  model,  which  is  typically  derived  from  some 
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Note  that  the  plot  in  Fig.  3-7  provides  explicit  requirements  for 
vibration  control.  For  example,  the  important  controlled  modes 
are  identified  as  those  above  the  e-line.  Also,  the  amount  of  damp¬ 
ing  each  controlled  mode  requires  is  clearly  shown.  Finally,  the 
effect  and  adequacy  of  any  structural  solutions,  such  as  stiffening, 
can  be  clearly  assessed. 

Class  IV:  Integrated  Painting  and  Vuiratian  Control— As  the  e-line 
moves  still  lower,  it  eventually  lies  below  the  average  response  of 
the  flexible  structure.  Damping  augmentation  alone  is  no  longer 
sufficient.  In  fact,  the  LOS  pointing  controller  must  now  of  neces¬ 
sity  provide  active  feedback  control  in  the  flexible  frequency  region 
as  shown  in  Fig.  3-9.  The  significant  feature  of  this  problem  is  that 
the  LOS  itself  must  now  be  measured  accurately  (or  inferred  from 
related  measurements)  up  to  very  high  frequency. 

Applicability  to  Large  Space  Antennas 

In  view  of  the  inevitable  overlap  between  structural  mode  frequencies  and 
control  bandwidth  required  to  meet  line-of-sight  and  antenna  surface  shape 
control  requirements  in  the  face  of  high  frequency  vibration  disturbances 
and  high-bandwidth  slewing  command  requirements,  the  large  space  antenna 
control  problem  falls  into  Class  IV.  Thus  we  now  examine  modeling  and  con¬ 
trol  design  issues  that  are  relevant  to  this  class  of  problems. 
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increase,  we  find  that  some  high-frequency  modes  exceed  the  e- 
line.  This  is  illustrated  in  Fig.  3-6.  If  high  frequency  disturbances 
are  generated  internally,  however,  it  is  often  possible  to  isolate 
them  (e.g..  shock-mounted  CMGs).  This  decreases  D ,  and 
reduces  the  problem  to  Class  I,  for  which  solutions  are  known. 

Class  III :  Attitude  Control  with  Isolation  and  Vibration  Control— 
There  are  limits,  however,  to  how  much  isolation  can  be  provided. 
One  of  the  major  limitations,  for  example,  occurs  when  the  LOS  con¬ 
troller  must  act  through  the  isolator  to  point  the  payload  (as  was 
true  for  the  ACOSS  II  model).  Rigid  body  stability  requirements 
then  impose  a  lower  limit  on  isolation  frequency.  For  example, 
assume  we  can  tolerate  no  more  than  45  degrees  of  phase  shift 
from  the  isolator  at  the  LOS  controller  crossover  frequency.  y0. 
Furthermore,  assume  we  wish  to  use  a  passive  second-order  isola¬ 
tor.  Then  the  Bode  gain/phase  relations  require  that 

Mfcs  %  12.5ue 

where  is  the  lowest  permissible  frequency  of  the  isolator.  Simi¬ 
lar  calculations  can  be  made  for  other  situations  such  as  active  iso¬ 
lation  or  higher-order  passive  isolation. 

Thus,  as  disturbances  increase  or  specifications  get  tighter,  we 
must  eventually  face  feedback  control  of  flexible  modes.  This  is 
illustrated  in  Fig.  3-7.  If  only  resonant  peaks  lie  above  the  e-line,  it 
is  sufficient  to  smooth  the  disturbance  response  to  more  closely  fol¬ 
low  the  average,  shown  dotted  in  Fig.  3-7  to  return  to  the  conven¬ 
tional  problem.  This  is  exactly  what  vibration  damping  does. 


? 


Note  that  1  and  2  together  allow  for 
bandwidths  in  terms  of  mission  specs 


< 


D„  -  LOSe 


j)  alone. 


a-priori  assessment  of  component 
(e)  and  disturbance  environments 


The  concepts  discussed  above  apply  to  flexible  spacecraft  as  well  as  rigid.  We 
merely  include  the  effects  of  flexibility  in  calculating  the  open  loop  distur¬ 
bance  responses.  Figure  3-4  is  an  example.  Again,  the  e-line  is  introduced  to 
determine  the  frequencies  where  feedback  control  is  necessary.  Now,  how¬ 
ever.  we  also  introduce  a  second  line,  a  vertical  line  at  u  -  «j,  which  serves 
to  partition  the  frequency  range  into  rigid  and  flexible.  Together  these  two 
lines  serve  to  categorize  the  control  problem. 


Control  Classes 

The  control  problem  can  be  categorized  according  to  where  the  e-line  and 
the  first  flexible  mode  flex-line  occur  relative  to  the  disturbance  responses. 
These  categories  describe  classes  of  pointing  control  problems  with  different 
degrees-of-difficulty  and  different  solution  approaches.  Note  tht  these 
classes  are  unrelated  to  those  of  Part  1. 


Class  I:  Conventional  Attitude  Control— When  all  disturbance  lim¬ 
its  to  the  right  of  the  flex-line  remain  below  the  e-line  (as  illustrated 
in  Fig.  3-5),  no  feedback  control  of  the  flexible  modes  is  required. 
We  then  have  a  conventional  rigid-body  design  problem  which  is 
routinely  solved  on  current  generation  spacecraft 

Class  II:  Attitude  Control  with  Isolation-lt  either  the  LOS 
specification  gets  tighter  (e-line  decreases)  or  the  disturbances 
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is  best  understood  graphically  by  plotting  the  right  hand  side,  |  jU0  -  LOSt 

of  Equation  (1).  Such  a  plot  is  shown  for  a  rigid  spacecraft  and  for  typical 
disturbances  in  Fig.  3-2  on  a  log-log  scale.  For  example,  solar  torques  cause 
open-loop  LOS  deflections  near  the  orbital  frequency.  CMG  rotor  imbalances 
cause  LOS  errors  at  higher  frequencies.  All  such  disturbances  can  be  pulled 
together  on  the  same  plot  according  to  their  effect  on  the  LOS  output  (Le., 


j).  In 


addition,  the  command  spectra  can  be  included  as  the  transform 


of  the  desired  response  to  LOS  commands.  Together  these  terms  give 

Ik  -losAI. 


Superimposed  on  Figure  3-2  is  an  e-line  which  defines  the  allowable 
error.  Consequence  No.  1  can  be  used  to  determine  the  requirements  on  L 
In  particular,  for  those  parts  of  the  figure  which  lie  below  the  e-line,  no  feed¬ 
back  is  required,  Le.,  j  |  L  j  «  1  j  at  those  frequencies.  However,  for  those 
parts  which  lie  above  the  line,  we  require  SL  I  L  j  >  1  in  order  to  ensure 
reduction  of  errors  to  the  e-level  in  the  closed  loop. 


These  observations  suggest  that  Fig.  3-2  can  be  used  to  graphically  deter¬ 
mine  the  suitability  of  any  L,  and  thus  of  any  compensator  K,  by  directly 
sketching  {  1+L  j.  Such  a  plot  is  shown  as  Fig.  3-3.  We  observe  that: 


1.  It  is  desirable  to  have  the  sjz.  S  1+L  j  li-e  follow  as  closely  as  possible  to 
the  disturbance  limits.  This  minimizes  control  authority. 

2.  The  point  where  sjl  J  1+L  J  =  s  is  the  crossover  .frequency,  ue .  All 
hardware  components  involved  in  this  loop  must  have  bandwidths  higher 
than  this  frequency. 
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In  the  minds  of  most  engineers,  these  fundamental  consequences  of  feedback 
are  associated  with  the  classical  single-input  single-output  feedback  theory. 
It  is  clear  from  Part  1  of  these  notes  that  they  are  equally  true  for  more 
complex  multi-input  multi-output  situations.  In  fact,  this  has  been  made 
clear  by  much  earlier  research  at  Honeywell  on  extensions  of  classical  con¬ 
cepts  to  multivariable  problems  using  singular  values.  This  corresponds  to 
the  case  2  framework  described  in  Part  1.  Using  this  framework  ,  Conse¬ 
quences  No.  1  through  No.  4  can  be  restated  for  the  multivariable  case  as  fol¬ 
lows: 


1.  +l|>  \d,  -  LOS9  | 

2.  ||  T~lN  j|  <  e 


Here  5  $  •  j  and  &.  j  •  J  are  the  maximum  and  minimum  singular  values  of  the 
indicated  matrices  and  jj*||  represents  the  magnitude  (norm)  of  the  indi¬ 
cated  vector.  V  {  •  J  can  be  interpreted  as  the  maximum  gain  which  the 
matrix  can  produce  (at  a  frequency)  and  &  can  similarly  be  interpreted  as 
the  minimum  gain.  This  point  of  view  is  discussed  in  more  detail  in  [DSt2]. 


Graphical  Interpretation 

There  is  no  need  to  delve  into  the  intricacies  of  matrix  theory  in  order  to 
understand  the  fundamentals  of  feedback.  Consequence  No.  1,  for  example, 
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<  e  whenever  jfij  »  1 

Consequence  No.  3  -Sensor  uncertainties  (errors  in  T) 
must  be  small  enough.  This  also  follows  from  the  second 
term,  which  requires  that 

<  i — - - whenever  !z>|  »  1 

LOS„  1  1 

This  result  specifies  the  uncertainty  which  can  be 
tolerated  in  the  LOS  derivation  at  each  frequency.  In 
essence,  it  establishes  requirements  for  alignment  and 
calibration  of  the  LOS  reference  system. 

Consequence  No.  4 -Model  uncertainties  (errors  in  L) 
must  be  small  enough.  This  follows  from  both  terms.  Let¬ 
ting  errors  in  L  be  AL,  these  terms  require  that 

1  +  L  +  AL  0 

for  ail  frequencies  and  all  AL.  One  way  to  ensure  that  this 
is  true  is  to  require 

AL  |<  ]  X  *  L  | 
for  all  frequencies  and  all  AL. 

This  is  a  statement  of  the  robustness  requirement  of  feed¬ 
back  systems  and  quantifies  the  amount  of  uncertainty 
which  can  be  tolerated  without  loss  of  stability. 

Multi-Inputs  and  Multi-Outputs 


Figure  3-7.  Class  III:  Attitude  Control  with  Isolation 
and  Vibration  Control 


w,  w,  log  w 

Figure  3-8.  Class  IV:  Integrated  Pointing  and  Vibration  Control 
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Figure  3-10.  Disturbance/Command  PSDs  (VCOSS  Studies) 
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Figure  3-11.  Actuator  Model  with  Unstructured  Uncertainty 
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Figure  3-12.  Sensor  Model  with  Unstructured  Uncertainty 


Table  L  LSS  Uncertainty;  E&erral  Disturbances  and  Noises 


Source 

Frequmcy  Range 

Cccmants 

Aerodynamic 

Low  (Orbit  Rate) 

Significant  f  cr  low-altitude  orbits,  h  <  500  km 
(Doninant  "external  tisturbence  for 

h  <  250  fcm) 

Soler 

Low  (Orbit  Rate) 

Independent  of  altitude  (Dominant  "external" 
disturbance  at  geosynchronous  altitude) 

Gravity  Gradient 

Low  (Orbit  Rate) 

M  ost  significant  at  low  altitude;  but  relatively 
week  for  h  <  R, 

i 

Earth  M  agnetic 

Low  (Orbit  Rate) 

1 

Most  sgraflcent  at  low  altitude  but  relatively  J 
weak  function  of  altitude 

Thermal 

Low  (Ortit  Rate) 

Independent  of  altitude 

Sours 


Frequsxy  Range 


Cammts 


i ... . ';/.jijM.j5a:a^-»iaaai-.asa 

Jet  Imbalances 

Mid.  (Steps) 

Solar  Panel  Drive 

Mid  (Step) 

• 

Sensor/A  ctuator  Errors 

Low-Mid 

Mid-High 

1 

1 

! 

i 

1 

All 

Vibration 

Mid-Kigb  Narrow 

Bend 

> 

Typically  the  dominant  'Internal"  disturbance 


Attitude  Reference  System 
Bases/Alignment/Noise 


Rate  Gyro  Noise/D  rift 


Reaction  If  heel/Control  U  ament  G  yro 


Reaction  W  heels  / Control  Moment  Gyros 
(snail} 

Cryogenic  Coolers  large  when 
High-energy  Weapons}  present 


Table  3.  LSS  Uncertainty.  Perturbaiions 


Source 

Frequency  Range 

Cm  mails 

Acbuatare/Sensore 

High 

Unknown  or  apcrcodnated  high-frequency 
dynamics  Often  modelled  as  /  +  A 

Structure 

All 

Uncertain  Maes,  Stiffness  Damping.  Two 
modelling  options: 

1)  Directly  as  parameter  variations 

2)  Indirectly  as  uncertain  modal  frequency, 
damping,  influence  coefficients 

Truncation 

High 

Neglected  dynamics,  many  modelling  options 
primariy  unstructured 
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